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NOVEL MEMBRANE PROTEINS AND POLYNUCLEOTIDES ENCODING THE SAME 

DESCRIPTION OF THE INVENTION 

[001] The present application claims priority to U.S. Provisional Application 
Number 60/199,950, which was filed April 27. 2000. The entire contents of U.S. 
Provisional Application Number 60/199,950 are Incorporated by reference herein for any 
purpose. 

Field of the Invention 

[002] The present invention relates to the discovery, identification and 
characterization of novel human polynucleotides that encode membrane associated 
proteins and receptors. The invention encompasses the described polynucleotides, 
host cell expression systems, the encoded proteins, fusion proteins, polypeptides and 
peptides, antibodies to the encoded proteins and peptides, and genetically engineered 
animals that lack the disclosed genes, or over express the disclosed genes, or 
antagonists and agonists of the proteins, and other compounds that modulate the 
expression or activity of the proteins encoded by the disclosed genes that can be used 
for diagnosis, drug screening, clinical trial monitoring, and/or the treatment of 
physiological or behavioral disorders. 

Background of the Invention 

[003] Membrane receptor proteins can serve as integral components of cellular 
mechanisms for sensing their environment, and maintaining cellular homeostasis and 
function. Accordingly, membrane receptor proteins are often involved in signal 
transduction pathways that control cell physiology, chemical communication, and gene 
expression. A particularly relevant class of membrane receptors are those typically 
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characterized by the presence of 7 conserved transmembrane domains that are 
interconnected by nonconserved hydrophilic loops. Such, "7TM receptors" include a 
superfamily of receptors known as G-protein coupled receptors (GPCRs). GPCRs are 
typically involved in signal transduction pathways involving G-proteins or PPG proteins. 
As such, the GPCR family includes many receptors that are known to serve as drug 
targets for therapeutic agents. 

SUMMARY OF THE INVENTION 

[004] The present invention relates to the discovery, identification, and 
characterization of nucleotides that encode novel GPCRs, and the corresponding novel 
GPCR (NGPCR) amino acid sequences. The NGPCRs described for the first time 
herein, are transmembrane proteins that span the cellular membrane and are Involved 
in signal transduction after ligand binding. The described NGPCRs have structural 
motifs found in the 7TM receptor family. Expression of the described NGPCRs can be 
detected in human placenta, bone marrow, trachea, testis, liver, and kidney cells. The 
novel human GPCR sequences described herein encode proteins of 769, 296, 848, 
481, 423, 560, 502, 241, and 320 amino acids in length (see respectively SEQ ID NOS: 
2, 4, 6, 8, 10, 12, 14, 16, and 18). The described NGPCRs have multiple 
transmembrane regions (of about 20-30 amino acids) characteristic of 7TM proteins as 
well as several predicted cytoplasmic domains. 

[005] Additionally contemplated are "knockout" ES cells that have been 
engineered using conventional methods (see, for example, PCT Application No. 
PCT/US98/03243, filed February 20, 1998, herein incorporated by reference). A gene 
trapped knockout ES cell line has been produced, targeting a murine homolog of the 
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described human sequences. Accordingly, an additional aspect of the present invention 
includes knockout cells and animals having genetically engineered mutations in the 
gene encoding the presently described NGPCRs. 

[006] The invention encompasses the nucleotide sequences presented in the 
Sequence Listing, host cells expressing such nucleotide sequences, and the expression 
products of such nucleotide sequences, and: (a) nucleotide sequences that encode 
mammalian homologs of the described NGPCRs, including the specifically described 
human NGPCRs, and the human NGPCR gene products; (b) nucleotide sequences that 
encode one or more portions of the NGPCRs that correspond to functional domains, 
and the polypeptide products specified by such nucleotide sequences, including but not 
limited to the novel regions of the described extracellular domain(s) ECD, one or more 
transmembrane domain(s) (TM) first disclosed herein, and the cytoplasmic domain(s) 
(CD); (c) isolated nucleotide sequences that encode mutants, engineered or naturally 
occurring, of the described NGPCRs in which all or a part of at least one of the domains 
is deleted or altered, and the polypeptide products specified by such nucleotide 
sequences, including but not limited to soluble receptors in which all or a portion of the 
TM is deleted, and nonfunctional receptors in which all or a portion of the CD is deleted; 
and (d) nucleotides that encode fusion proteins containing the codingTegion from an 
NGPCR, or one of its domains (e.g., an ECD) fused to another peptide or polypeptide. 

[007] The invention also encompasses agonists and antagonists of the 
NGPCRs, including small molecules, large molecules, mutant NGPCR proteins, or 
portions thereof that compete with the native NGPCR, and antibodies, as well as 
nucleotide sequences that can be used to inhibit the expression of the described 
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NGPCR (e.g., antisense and ribozyme molecules, and gene or regulatory sequence 
replacement constmcts) or to enhance the expression of the described NGPCR gene 
(e.g., expression constructs that place the described gene under the control of a strong 
promoter system), and transgenic animals that express a NGPCR transgene or "knock- 
outs" that do not express a functional NGPCR. 

[008] Further, the present invention also relates to methods for the use of the 
described NGPCR gene and/or NGPCR gene products for the identification of 
compounds that modulate, /.e., act as agonists or antagonists, of NGPCR gene 
expression and or NGPCR gene product activity. Such compounds can be used as 
therapeutic agents for the treatment of various symptomatic representations of 
biological disorders or imbalances. 

BRIEF DESCRIPTION OF THE SEQUENCE LISTINGS 

[009] The Sequence Listing provides the sequences of the certain NGPCR 
ORFs, the amino acid sequences encoded thereby, as well as an ORF with surrounding 
5' and 3' regions (SEQ ID NO: 19). 

DESCRIPTION OF EMBODIMENTS 

[010] The human NGPCRs, described for the first time herein, are novel 
receptor proteins that are expressed in human cells. The human NGPCR sequences 
were obtained using sequences from gene trapped human cells and cDNA clones 
isolated from human trachea, liver, and testis cDNA libraries (Edge Biosystems, 
Gaithersburg, MD, and Clontech, Palo Alto, CA). The described NGPCRs are 
transmembrane proteins that fall within the 7TM family of receptors. As with other 
GPCRs, signal transduction is triggered when a ligand binds to the receptor. Interfering 
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with the binding of the natural ligand, or neutralizing or removing the ligand, or 
interfering with its binding to a NGPCR will affect NGPCR mediated signal transduction. 
Because of their biological significance, 7TM, and particularly GPCR, proteins have 
been subjected to intense scientific/commercial scrutiny (see, for example, U.S. 
Application Ser, Nos. 08/820,521, filed March 19. 1997, and 08/833,226, filed April 17, 
1997 both of which are herein incorporated by reference in their entirety for applications, 
uses, and assays involving the described NGPCRs). In addition to 7TM proteins, the 
presently described NGPCRs share significant homology with a domain of the 
mammalian TRK-fused gene, latrophilin (latrotoxin receptor), and peptide hormone 
receptors. 

[Oil] The invention encompasses the use of the described NGPCR 
nucleotides, NGPCR proteins and peptides, as well as antibodies, preferably 
humanized monoclonal antibodies, or binding fragments, domains, or fusion proteins 
thereof, to the NGPCRs (which can, for example, act as NGPCR agonists or 
antagonists), antagonists that inhibit receptor activity or expression, or agonists that 
activate receptor activity or increase its expression in the diagnosis and treatment of 
disease. 

[012] In particular, the invention described In the subsections below 
encompasses NGPCR polypeptides or peptides corresponding to functional domains of 
NGPCR (e.g., ECD, TM or CD), mutated, truncated or deleted NGPCRs (e.g., NGPCRs 
missing one or more functional domains or portions thereof, such as, AECD, ATM 
and/or ACD), NGPCR fusion proteins (e.g., a NGPCR or a functional domain of a 
NGPCR, such as the ECD, fused to an unrelated protein or peptide such as an 
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immunoglobulin constant region, /.e., IgFc), nucleotide sequences encoding such 
products, and host cell expression systems that can produce such NGPCR products. 

[013] The invention also encompasses antibodies and anti-idiotypic antibodies 
(including Fab fragments), antagonists and agonists of the NGPCR, as well as 
compounds or nucleotide constructs that inhibit expression of a NGPCR gene 
(transcription factor inhibitors, antisense and ribozyme molecules, or gene or regulatory 
sequence replacement constructs), or promote expression of NGPCR (e.g., expression 
constructs in which NGPCR coding sequences are operatively associated with 
expression control elements such as promoters, promoter/enhancers, etc.). The 
invention also relates to host cells and animals genetically engineered to express the 
human NGPCRs (or mutants thereof) or to inhibit or "knock-out" expression of the 
animal's endogenous NGPCR genes. 

[014] The NGPCR proteins or peptides. NGPCR fusion proteins, NGPCR 
nucleotide sequences, antibodies, antagonists and agonists can be useful for the 
detection of mutant NGPCRs or inappropriately expressed NGPCRs for the diagnosis of 
disease. The NGPCR proteins or peptides, NGPCR fusion proteins, NGPCR nucleotide 
sequences, host cell expression systems, antibodies, antagonists, agonists and 
genetically engineered cells and animals can be used for screening for drugs (or high 
throughput screening of combinatorial libraries) effective in the treatment of the 
symptomatic or phenotypic manifestations of perturbing the normal function of NGPCR 
in the body. The use of engineered host cells and/or animals may offer an advantage in 
that such systems allow not only for the identification of compounds that bind to an ECD 
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. of a NGPCR. but can also identify compounds that affect the signal transduced by an 
activated NGPCR. 

[015] According to certain embodiments, the nucleotide sequences 
encompassed by the invention can be useful for chromosome mapping. For example, 
the nucleotide sequences as set forth in SEQ ID NOs 1, 3, 5, 7, 9, 11. 13. 15, 17, and 
19 are found on chromosome 3 at 3q1 1.1 in the human genome. In certain 
embodiments, these sequences can act as highly specific probes to show which regions 
of the chromosome actually code for protein. Thus, in certain embodiments, these 
sequences can provide mapping information for protein coding regions within the 
3q1 1.1 location of chromosome 3. In certain embodiments, these sequences allow for 
the identification of exons and the verification of splice junction sites. In certain 
embodiments, these sequences also can allow for the identification of the genomic 
locations for the NGPCR gene in mouse. This Information is useful for creating "knock- 
out" mice in which the expression of this protein is abrogated. 

[016] Finally, the NGPCR protein products (especially soluble derivatives such 
as peptides con^esponding to the NGPCR ECD, or truncated polypeptides lacking one 
or more TM domains) and fusion protein products (especially NGPCR-lg fusion 
proteins, /.e., fusions of a NGPCR, or a domain of a NGPCR. e.g., ECD, ATM to an 
IgFc), antibodies and anti-id iotypic antibodies (including Fab fragments), antagonists or 
agonists (including compounds that modulate signal transduction which may act on 
downstream targets in a NGPCR-mediated signal transduction pathway) can be used 
for therapy of such diseases. For example, the administration of an effective amount of 
soluble NGPCR ECD, ATM, or an ECD-lgFc fusion protein or an antl-ldiotypic antibody 
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(or its Fab) that mimics the NGPCR ECD would "mop up" or "neutralize" the 
endogenous NGPCR ligand, and prevent or reduce binding and receptor activation. 
Nucleotide constructs encoding such NGPCR products can be used to genetically 
engineer host cells to express such products in vivo; these genetically engineered cells 
function as "bioreactors" in the body delivering a continuous supply of a NGPCR, a 
NGPCR peptide, soluble ECD or ATM or a NGPCR fusion protein that will "mop up" or 
neutralize a NGPCR ligand. Nucleotide constructs encoding functional NGPCRs, 
mutant NGPCRs, as well as antisense and ribozyme molecules can be used in "gene 
therapy" approaches for the modulation of NGPCR expression. Thus, the invention also 
encompasses pharmaceutical formulations and methods for treating biological 
disorders. 

[01 7] Various aspects of the invention are described in greater detail in the 
subsections below. 

THE NGPCR POLYNUCLEOTIDES 
[018] The cDNA sequences and deduced amino acid sequences of certain 

described human NGPCRs are presented in the Sequence Listing. Several 

polymorphisms were identified during the sequencing of the described NGPCRs 

including: a C or T transition at the positions represented by, for example, nucleotide 

1075 of SEQ ID N0:1 and nucleotide 211 of SEQ ID N0:9 both of which can results in a 

glutamlne (preferred) or a stop codon being present at the corresponding amino acid 

positions (for example, 359 or 71 respectively)of the encoded proteins. 

[019] The NGPCRs of the present invention include: (a) the human DNA 

sequences presented in the Sequence Listing and any additionally contemplated 
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nucleotide sequence encoding a contiguous and functional NGPCR open reading frame 
(ORF) that hybridizes to a complement of the DNA sequences presented in the 
Sequence Listing under highly stringent conditions, e.g., hybridization to filter-bound 
DNA in 0.5 M NaHP04, 7% sodium dodecyl sulfate (SDS), 1 mM EDTA at 65°C, and 
washing in 0.1xSSC/0.1% SDS at 68°C (Ausubel P.M. etaL, eds., 1989, Current 
Protocols in Molecular Biology, Vol, I, Green Publishing Associates, Inc., and John 
Wiley & sons. Inc., New York, at p. 2.10.3) and encodes a functionally equivalent gene 
product. Additionally contemplated are any nucleotide sequences that hybridize to the 
complement of the DNA sequences that encode and express an amino acid sequence 
presented in the Sequence Listing under moderately stringent conditions, e.g., washing 
in 0.2xSSC/0.1% SDS at 42°G (Ausubel et al., 1989, supra), yet which still encode a 
functionally equivalent NGPCR gene product. Functional equivalents of NGPCR 
include naturally occurring NGPCRs present in other species, and mutant NGPCRs 
whether naturally occurring or engineered. The invention also includes degenerate 
variants of the disclosed sequences. 

[020] Additionally contemplated are polynucleotides encoding NGPCR ORFs, 
or their functional equivalents, encoded by polynucleotide sequences that are about 99, 
95, 90, or about 85 percent similar or identical to corresponding regions of the 
polynucleotide sequences described in the Sequence Listing (as measured by BLAST 
sequence comparison analysis using, for example, the GCG sequence analysis 
package using default parameters). 

[021] In certain embodiments, the invention includes nucleic acid molecules 
that comprise at least 300 contiguous bases of a human NGPCR nucleotide sequence. 
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in certain embodiments, tiie invention also includes nucleic acid molecules, preferably 
DNA molecules, that hybridize to, and are therefore the complements of, the described 
NGPCR nucleotide sequences. Such hybridization conditions may be highly stringent or 
moderately stringent (less highly stringent), as described above. In instances wherein 
the nucleic acid molecules are deoxyoligonucleotides ("DNA oligos"), such molecules 
(in certain embodiments, those about 16 to about 100 base long, about 20 to about 80, 
or about 34 to about 45 base long, or those of at least 300 nucleotides in length, or any 
variation or combination of sizes represented therein incorporating a contiguous region 
of sequence first disclosed in the present Sequence Listing) can be used in conjunction 
with the polymerase chain reaction (PGR) to screen libraries, isolate clones, and 
prepare cloning and sequencing templates, etc. 

[022] Alternatively, the oligonucleotides can be used singly or in chip format as 
hybridization probes. For example, a series of the described NGPCR oligonucleotide 
sequences, or the complements thereof, can be used to represent all or a portion of the 
described NGPCRs. The oligonucleotides, typically between about 16 to about 40 (or 
any whole number within the stated range) nucleotides in length may partially overlap 
each other and/or the NGPCR sequence may be represented using oligonucleotides 
that do not overiap. Accordingly, the described NGPCR polynucleotide sequences shall 
typically comprise at least about two or three distinct oligonucleotide sequences of at 
least about 18 nucleotides in length that are each first disclosed in the described 
Sequence Listing. Such oligonucleotide sequences may begin at any nucleotide 
present within a sequence in the Sequence Listing and proceed in either a sense (5'-to- 
3') orientation vis-a-vis the described sequence or in an antisense orientation. For 



10 



wo 01/81411 



PCTAJSOl/13371 



oligonucleotides probes, highly stringent conditions may refer, e.g.. to washing in 
6xSSC/0.05% sodium pyrophosphate at 37°C (for 14-base oligos). 48°C (for 17-base 
oligos), 55°C (for 20-base oligos), and 60°C (for 23-base oligos). 

[023] The described oligonucleotides may encode or act as NGPCR antisense 
molecules, useful, for example, in NGPCR gene regulation (for and/or as antisense 
primers in amplification reactions of NGPCR gene nucleic acid sequences). With 
respect to NGPCR gene regulation, such techniques can be used to regulate biological 
functions. Further, such sequences may be used as part of ribozyme and/or triple helix 
sequences, also useful for NGPCR gene regulation. 

[024] Additionally, the antisense oligonucleotides may comprise at least one 
modified base moiety which is selected from the group including, but not limited to, 
5-fluorouracil, 5-bromouracll, 5-chlorouracil, 5-iodouracil, hypoxanthine, xantine, 

4- acetylcytoslne, 5-(carboxyhydroxylmethyl) uracil, 5-carboxymethylaminomethyl- 
2-thiouridine, 5-carboxymethylaminomethyluracil, dihydrouracil, beta-D- 
galactosylqueosine. inosine, N6-isopentenyladenine. 1-methylguanine, 1-methylinosine, 
2.2-dimethylguanine, 2-methyladenin.e, 2-methylguanine, 3-methylcytosine, 

5- methylcytosine. N6-adenine. 7-methylguanine, 5-methylaminomethyluracil, 
5-methoxyaminomethyl-2-thiouracil, beta-D-mannosylqueosine, 
5'-methoxycarboxymethyluracil, 5-methoxyuracil, 2-methylthio-N6-isopentenyladenine, 
uracil-5-oxyacetic acid (v), wybutoxosine, pseudouracil. queosine, 2-thiocytosine, 
5-methyl-2-thiouracil, 2-thiouracil, 4-thiouracil, 5-methyluracil, uracil-5-oxyacetic acid 
methylester, uracil-5-oxyacetic acid (v), 5-methyl-2-thiouracil, 3-(3-amino-3-N-2- 
carboxypropyl) uracil, (acp3)w, and 2,6-diaminopurine. 

11 
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[025] The antisense oligonucleotide may also comprise at least one modified 
sugar moiety selected from the group including but not limited to arabinose, 
2-fluoroarabinose, xylulose, and hexose. 

[026] In yet another embodiment, the antisense oligonucleotide comprises at 
least one modified phosphate backbone selected from the group consisting of a 
phosphorothioate, a phosphorodithioate, a phosphoramidothioate, a phosphoramidate, 
a phosphordiamidate, a methylphosphonate, an alkyi phosphotriester, and a formacetal 
or analog thereof. 

[027] In yet another embodiment, the antisense oligonucleotide is an 
a-anomeric oligonucleotide. An a-anomeric oligonucleotide forms specific double- 
stranded hybrids with complementary RNA in which, contrary to the usual p-units, the 
strands run parallel to each other (Gautler ef aA, 1987, Nucl. Acids Res. -/ 5:6625-6641). 
The oligonucleotide is a 2'-0-methylribonucleotide (Inoue ef a/., 1987, Nucl. Acids Res. 
^5:6131-6148), or a chimeric RNA-DNA analogue (Inoue etaL, 1987, FEBS Lett. 
2^5:327-330). 

[028] Oligonucleotides of the invention may be synthesized by standard 
methods known In the art, e.g. by use of an automated DNA synthesizer (such as are 
commercially available from Biosearch, Applied Biosystems, etc.). As examples, 
phosphorothioate oligonucleotides may be synthesized by the method of Stein et al. 
(1988, Nucl, Acids Res. Y6:3209), methylphosphonate oligonucleotides can be prepared 
by use of controlled pore glass polymer supports (Sarin et ai, 1988, Proc. Natl. Acad. 
Sci. U.S.A. 85:7448-7451). etc. 
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[029] Low stringency conditions are well known to tliose of skill in the art, and 
will vary predictably depending on the specific organisms from which the library and the 
labeled sequences are derived. For guidance regarding such conditions see, for 
example, Sambrook et a/., 1989, Molecular Cloning, A Laboratory Manual (and periodic 
updates thereof), Cold Springs Harbor Press, N.Y.; and Ausubel ef a/., 1989, Current 
Protocols in Molecular Biology, Green Publishing Associates and Wiley Interscience. 
N.Y, 

[030] Alternatively, suitably labeled NGPCR nucleotide probes may be used to 
screen a human genomic library using appropriately stringent conditions or by PCR. 
The identification and characterization of human genomic clones is helpful for identifying 
polymorphisms, determining the genomic structure of a given locus/allele, and 
designing diagnostic tests. For example, sequences derived from regions adjacent to 
the intron/exon boundaries of the human gene can be used to design primers for use in 
amplification assays to detect mutations within the exons, introns, splice sites (e.g., 
splice acceptor and/or donor sites), etc., that can be used in diagnostics and 
pharmacogenomics. 

[031] Further, a NGPCR gene homolog may be Isolated from nucleic acid of 
the organism of interest by performing PCR using two degenerate oligonucleotide 
primer pools designed on the basis of amino acid sequences within the NGPCR gene 
product disclosed herein. The template for the reaction may be total RNA, mRNA, 
and/or cDNA obtained by reverse transcription of mRNA prepared from, for example, 
human or non-human cell lines or tissue known or suspected to express a NGPCR 
gene allele. 
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[032] The PGR product may be subcloned and sequenced to ensure that the 
amplified sequences represent the sequence of the desired NGPCR gene. The PGR 
fragment may then be used to isolate a full length cDNA clone by a variety of methods. 
For example, the amplified fragment may be labeled and used to screen a cDNA library, 
such as a bacteriophage cDNA library. Alternatively, the labeled fragment may be used 
to isolate genomic clones via the screening of a genomic library, 

[033] PGR technology may also be utilized to isolate full length cDNA 
sequences. For example, RNA may be isolated, following standard procedures, from 
an appropriate cellular or tissue source (/.e., one known, or suspected, to express a 
NGPGR gene). A reverse transcription (RT) reaction may be perfomried on the RNA 
using an oligonucleotide primer specific for the most 5' end of the amplified fragment for 
the priming of first strand synthesis. The resulting RNA/DNA hybrid may then be 
"tailed" using a standard terminal transferase reaction, the hybrid may be digested with 
RNase H, and second strand synthesis may then be primed with a complementary 
primer. Thus, cDNA sequences upstream of the amplified fragment may easily be 
isolated. For a review of cloning strategies which may be used, see e.g., Sambrook et 
aL, 1989, supra. 

[034] A cDNA of a mutant NGPGR gene can be isolated, for example, by using 
PGR. In this case, the first cDNA strand may be synthesized by hybridizing an oligo-dT 
oligonucleotide to mRNA isolated from tissue known or suspected to be expressed in an 
individual putatively carrying a mutant NGPGR allele, and by extending the new strand 
with reverse transcriptase. The second strand of the cDNA is then synthesized using 
an oligonucleotide that hybridizes specifically to the 5' end of the normal gene. Using 
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these two primers, the product is then amplified via PCR, optionally cloned into a 
suitable vector, and subjected to DNA sequence analysis through methods well known 
to those of skill in the art. By comparing the DNA sequence of the mutant NGPCR 
allele to that of the normal NGPCR allele, the mutation(s) responsible for the loss or 
alteration of function of the mutant NGPCR gene product can be ascertained. 

[035] Altematively, a genomic library can be constructed using DNA obtained 
from an individual suspected of or known to carry the mutant NGPCR allele, or a cDNA 
library can be constructed using RNA from a tissue known, or suspected, to express the 
mutant NGPCR allele. A normal NGPCR gene, or any suitable fragment thereof, can 
then be labeled and used as a probe to identify the corresponding mutant NGPCR allele 
in such libraries. Clones containing the mutant NGPCR gene sequences can then be 
purified and subjected to sequence analysis according to methods well known to those 
of skill in the art. 

[036] Additionally, an expression library can be constructed utilizing cDNA 
synthesized from, for example, RNA Isolated from a tissue known, or suspected, to 
express a mutant NGPCR allele in an Individual suspected of or known to carry such a 
mutant allele. In this manner, gene products made by the putatively mutant tissue may 
be expressed and screened using standard antibody screening techniques in 
conjunction with antibodies raised against the normal NGPCR gene product, as 
described, below, in Section 5.3. (For screening techniques, see, for example, Harlow, 
E. and Lane, eds., 1988, "Antibodies: A Laboratory Manual", Cold Spring Harbor Press, 
Cold Spring Harbor) 
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[037] Additionally, screening can be accomplished by screening with labeled 
NGPCR fusion proteins, such as, for example, AP-NGPCR or NGPCR-AP fusion 
proteins. In cases where a NGPCR mutation results in an expressed gene product with 
altered function (e.g., as a result of a missense or a frameshift mutation), a polyclonal 
set of antibodies to NGPCR are likely to cross-react with the mutant NGPCR gene 
product. Library clones detected via their reaction with such labeled antibodies can be 
purified and subjected to sequence analysis according to methods well known to those 
of skill in the art. 

[038] The invention also encompasses nucleotide sequences that encode 
mutant NGPCRs. peptide fragments of the NGPCRs, truncated NGPCRs, and NGPCR 
fusion proteins. These include, but are not limited to, nucleotide sequences encoding 
mutant NGPCRs described below; polypeptides or peptides corresponding to one or 
more ECD. TM and/or CD domains of the NGPCR or portions of these domains; 
tmncated NGPCRs in which one or two of the domains is deleted, e.g., a soluble 
NGPCR lacking the TM or both the TM and CD regions, or a truncated, nonfunctional 
NGPCR lacking all or a portion of the CD region. Nucleotides encoding fusion proteins 
may include, but are not limited to, full length NGPCR sequences, truncated NGPCRs, 
or nucleotides encoding peptide fragments of NGPCR fused to an unrelated protein or 
peptide, such as for example, a transmembrane sequence, which anchors the NGPCR 
ECD to the cell membrane; an IgFc domain which increases the stability and half life of 
the resulting fusion protein (e.g., NGPCR-lg) in the bloodstream; or an enzyme, 
fluorescent protein, luminescent protein which can be used as a marker. 
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[039] The invention also encompasses (a) DNA vectors that contain any of the 

foregoing NGPCR coding sequences and/or their complements (/.e., antisense); (b) 

DNA expression vectors that contain any of the foregoing NGPCR coding sequences 

operatively associated with a regulatory element that directs the expression of the 

coding sequences; and (c) genetically engineered host cells that contain any of the 

foregoing NGPCR coding sequences operatively associated with a regulatory element 

that directs the expression of the coding sequences in the host cell. As used herein. 

regulatory elements include but are not limited to inducible and non-inducible 

promoters, enhancers, operators and other elements known to those skilled in the art 

that drive and regulate expression. Such regulatory elements include but are not limited 

to the cytomegalovirus hCMV immediate early gene, regulatable. viral (particularly 

retroviral LTR promoters) the early or late promoters of SV40 adenovirus, the lac 

system, the tre system, the tet system, the TAC system, the TRC system, the major 

operator and promoter regions of phage A, the control regions of fd coat protein, the 

promoter for 3-phosphoglycerate kinase (PGK), the promoters of acid phosphatase, and 

the promoters of the yeast a-mating factors. 

NGPCR PROTEINS AND POLYPEPTIDES 
[040] NGPCR proteins, polypeptides and peptide fragments, mutated, 

truncated or deleted forms of the NGPCR and/or NGPCR fusion proteins can be 

prepared for a variety of uses, including but not limited to the generation of antibodies, 

as reagents in diagnostic assays, the identification of other cellular gene products 

related to a NGPCR, as reagents in assays for screening for compounds that can be 

used as pharmaceutical reagents useful in the therapeutic treatment of mental. 
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biological, or medical disorders (/.e., heartbeat rate, improper blood pressure, etc.) and 
disease. 

[041] The Sequence Listing discloses the amino acid sequences encoded by 
certain NGPCR genes. The NGPCRs have initiator methionines in DNA sequence 
contexts consistent with translation initiation sites, followed by hydrophobic signal 
sequences typical of membrane associated proteins. The sequence data presented 
herein indicate that alternatively spliced forms of the NGPCRs exist (which may or may 
not be tissue specific). 

[042] The NGPCR amino acid sequences of the invention include the 
nucleotide and amino acid sequences presented In the Sequence Listing as well as 
analogues and derivatives thereof. Further, corresponding NGPCR homologues from 
other species are encompassed by the invention. In fact, any NGPCR protein encoded 
by the NGPCR nucleotide sequences described above are within the scope of the 
invention, as are any novel polynucleotide sequences encoding all or any novel portion 
of an amino acid sequence presented in the Sequence Listing. The degenerate nature 
of the genetic code is well known, and, accordingly, each amino acid presented in the 
Sequence Listing, is generically representative of the well known nucleic acid "triplet" 
codon, or in many cases codons, that can encode the amino acid. As such, as 
contemplated herein, the amino acid sequences presented In the Sequence Listing, 
when taken together with the genetic code (see, for example, Table 4-1 at page 109 of 
"Molecular Cell Biology", 1986, J. Darnell etaL eds.. Scientific American Books, New 
York, NY, herein incorporated by reference) are generically representative of all the 
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various permutations and combinations of nucleic acid sequences that can encode such 
amino acid sequences. 

[043] The invention also encompasses proteins that are functionally equivalent 
to the NGPCR encoded by the described nucleotide sequences as judged by any of a 
number of criteria, including but not limited to the ability to bind a ligand for a NGPCR, 
the ability to affect an identical or complementary signal transduction pathway, a change 
in cellular metabolism {e.g,, ion flux, tyrosine phosphorylation, etc.) or a change in 
phenotype when the NGPCR equivalent is present in an appropriate cell type (such as 
the amelioration, prevention or delay of a biochemical, biophysical, or overt phenotype. 
Such functionally equivalent NGPCR proteins include but are not limited to additions or 
substitutions of amino acid residues within the amino acid sequence encoded by the 
NGPCR nucleotide sequences described above but which result in a silent change, thus 
producing a functionally equivalent gene product. For example, in certain 
embodiments, one can employ a conservative amino acid substitution or substitutions 
which may be made on the basis of similarity in polarity, charge, solubility, 
hydrophobicity, hydrophilicity, and/or the amphlpathic nature of the residues involved. 
For example, nonpolar (hydrophobic) amino acids include alanine, leucine, isoleucine, 
valine, proline, phenylalanine, tryptophan, and methionine; polar neutral amino acids 
include glycine, serine, threonine, cysteine, tyrosine, asparagine, and glutamine; 
positively charged (basic) amino acids include arginine, lysine, and histidine; and 
negatively charged (acidic) amino acids include aspartic acid and glutamic acid. 

[044] While random mutations can be made to NGPCR DNA (using random 
mutagenesis techniques well known to those skilled in the art) and the resulting mutant 
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NGPCRs tested for activity, site-directed mutations of the NGPCR coding sequence can 
be engineered (using site-directed mutagenesis techniques well known to those skilled 
in the art) to generate mutant NGPCRs with increased function, e.g.. higher binding 
affinity for the target ligand, and/or greater signaling capacity; or decreased function, 
and/or decreased signal transduction capacity. One starting point for such analysis .is 
by aligning the disclosed human sequences with corresponding gene/protein sequences 
from, for example, other mammals in order to identify amino acid sequence motifs that 
are conserved between different species. Non-conservative changes can be 
engineered at variable positions to alter function, signal transduction capability, or both. 
Alternatively, where alteration of function is desired, deletion or non-conservative 
alterations of the conserved regions (/,e., identical amino acids) can be engineered. For 
example, deletion or non-conservative alterations (substitutions or insertions) of the 
various conserved transmembrane domains. 

[045] An additional application of the described NGPCR polynucleotide 
sequences is their use in the molecular mutagenesis/evolution of proteins that are at 
least partially encoded by the described novel sequences using, for example, 
polynucleotide shuffling or related methodologies. Such approaches are described in 
U.S. Patents Nos. 5,830,721 and 5,837,458 which are herein incorporated by reference 
in their entirety. 

[046] Additionally contemplated uses for the described sequences include the 
engineering of constitutively "on" variants for use in cell assays and genetically 
engineered animals using the methods and applications described in U.S. Patent 
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Applications Ser Nos. 60/110,906, 60/106,300. 60/094,879, and 60/121,851 all of which 
are herein incorporated by reference in their entirety. 

[047] Other mutations to the NGPCR coding sequence can be made to 
generate NGPCRs that are better suited for expression, scale up, etc. in the host cells 
chosen. For example, cysteine residues can be deleted or substituted with another 
amino acid in order to eliminate disulfide bridges; N-linked glycosylation sites can be 
altered or eliminated to achieve, for example, expression of a homogeneous product 
that is more easily recovered and purified from yeast hosts which are known to 
hyperglycosylate N-linked sites. To this end, a variety of amino acid substitutions at one 
or both of the first or third amino acid positions of any one or more of the glycosylation 
recognition sequences which occur in the ECD (N-X-S or N-X-T), and/or an amino acid 
deletion at the second position of any one or more such recognition sequences in the 
ECD will prevent glycosylation of the NGPCR at the modified tripeptide sequence. 
(See, e.g., IVIiyajima et al., 1986, EMBO J. 5(6):1 193-1 197). 

[048] Peptides corresponding to one or more domains of the NGPCR (e.g., 
ECD, TM, CD, etc.), truncated or deleted NGPCRs (e.g., NGPCR in which a ECD, TM 
and/or CD Is deleted) as well as fusion proteins in which a full length NGPCR, a 
NGPCR peptide, or truncated NGPCR is fused to an unrelated protein, are also within 
the scope of the invention and can be designed on the basis of the presently disclosed 
NGPCR nucleotide and NGPCR amino acid sequences. Such fusion proteins include 
but are not limited to IgFc fusions which stabilize the NGPCR protein or peptide and 
prolong half-life in vivo; or fusions to any amino acid sequence that allows the fusion 
protein to be anchored to the cell membrane, allowing an ECD to be exhibited on the 
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cell surface; or fusions to an enzyme, fluorescent protein, or luminescent protein which 
provide a marker function. 

[049] While the NGPCR polypeptides and peptides can be chemically 
synthesized (e.g., see Creighton, 1983, Proteins: Stmctures and Molecular Principles, 
W.H. Freeman & Co.. N.Y.), large polypeptides derived from a NGPCR and full length 
NGPCRs can be advantageously produced by recombinant DNA technology using 
techniques well known in the art for expressing nucleic acid containing NGPCR gene 
sequences and/or coding sequences. Such methods can be used to construct 
expression vectors containing a presently described NGPCR nucleotide sequences and 
appropriate transcriptional and translational control signals. These methods include, for 
example, in vitro recombinant DNA techniques, synthetic techniques, and in vivo 
genetic recombination. See, for example, the techniques described in Sambrook et al.. 
1989, supra, and Ausubel et al., 1989, supra. Alternatively, RNA corresponding to all or 
a portion of a transcript encoded by a NGPCR nucleotide sequence may be chemically 
synthesized using, for example, synthesizers. See, for example, the techniques 
described in "Oligonucleotide Synthesis", 1984, Gait, M.J. ed., IRL Press, Oxford, which 
is incorporated by reference herein in its entirety. 

[050] A variety of host-expression vector systems may be utilized to express 
the NGPCR nucleotide sequences of the invention. Where the NGPCR peptide or 
polypeptide is a soluble derivative (e.g., NGPCR peptides corresponding to an ECD; 
truncated or deleted NGPCR in which a TM and/or CD are deleted) the peptide or 
polypeptide can be recovered from the culture, i.e., from the host cell in cases where 
the NGPCR peptide or polypeptide is not secreted, and from the culture media in cases 
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where the NGPCR peptide or polypeptide is secreted by the cells. However, such 
expression systems also encompass engineered host cells that express a NGPCR, or 
functional equivalent, in situ, i.e., anchored in the cell membrane. Purification or 
enrichment of NGPCR from such expression systems can be accomplished using 
appropriate detergents and lipid micelles and methods well known to those skilled in the 
art. However, such engineered host cells themselves may be used in situations where 
it is important not only to retain the structural and functional characteristics of the 
NGPCR, but to assess biological activity, e.g., in drug screening assays. 

[051] The expression systems that may be used for purposes of the invention 
include but are not limited to microorganisms such as bacteria (e.g., E. coli, B. subtilis) 
transformed with recombinant bacteriophage DNA, plasmid DNA or cosmid DNA 
expression vectors containing NGPCR nucleotide sequences; yeast (e.g., 
Saccharomyces, Pichia) transfomned with recombinant yeast expression vectors 
containing NGPCR nucleotide sequences; insect cell systems infected with recombinant 
virus expression vectors (e.g., baculovirus) containing NGPCR sequences; plant cell 
systems infected with recombinant virus expression vectors (e.g., cauliflower mosaic 
virus, CaMV; tobacco mosaic virus, TMV) or transfomied with recombinant plasmid 
expression vectors (e.g., Ti plasmid) containing NGPCR nucleotide sequences; or 
mammalian cell systems (e.g., COS, CHO, BHK, 293. 3T3) harboring recombinant 
expression constructs containing promoters derived from the genome of mammalian 
cells (e.g.,.metallothionein promoter) or from mammalian viruses (e.g., the adenovirus 
late promoter; the vaccinia virus 7.5K promoter). 
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[052] In bacterial systems, a number of expression vectors may be 
advantageously selected depending upon the use intended for the NGPCR gene 
product being expressed. For example, when a large quantity of such a protein is to be 
produced, for the generation of pharmaceutical compositions of NGPCR protein or for 
raising antibodies to a NGPCR protein, for example, vectors that direct the expression 
of high levels effusion protein products that are readily purified may be desirable. Such 
vectors include, but are not limited, to the E. coll expression vector pUR278 (Rutheret 
al., 1983, EMBO J. 2:1791), in which a NGPCR coding sequence may be ligated 
individually into the vector in frame with the lacZ coding region so that a fusion protein is 
produced; pIN vectors (Inouye & Inouye. 1985, Nucleic Acids Res. 13:3101-3109; Van 
Heeke & Schuster, 1989, J. Biol. Chem. 264:5503-5509); and the like. pGEX vectors 
may also be used to express foreign polypeptides as fusion proteins with glutathione 
S-transferase (GST). In general, such fusion proteins are soluble and can easily be 
purified from lysed cells by adsorption to glutathione-agarose beads followed by elution 
in the presence of free glutathione. The PGEX vectors are designed to include 
thrombin or factor Xa protease cleavage sites so that the cloned target gene product 
can be released from the GST moiety. 

[053] In an insect system, Autographa californica nuclear polyhidrosis virus 
(AcNPV) is used as a vector to express foreign genes. The virus grows in Spodoptera 
frugiperda cells. A NGPCR gene coding sequence may be cloned individually into non- 
essential regions (for example the polyhedrin gene) of the virus and placed under 
control of an AcNPV promoter (for example the polyhedrin promoter). Successful 
Insertion of NGPCR gene coding sequence will result In inactivation of the polyhedrin 
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gene and production of non-occluded recombinant virus (i.e., virus lacking the 
proteinaceous coat coded for by the polyhedrin gene). These recombinant viruses are 
then used to infect Spodoptera frugiperda cells in which the inserted gene is expressed 
(e.g.. see Smith et a!., 1983. J. Virol. 46: 584; Smith. U.S. Patent No. 4,215,051). In 
mammalian host cells, a number of viral-based expression systems may be utilized. In 
cases where an adenovirus is used as an expression vector, tine NGPCR nucleotide 
sequence of interest may be ligated to an adenovirus transcription/translation control 
complgx. e.g., the late promoter and tripartite leader sequence. This chimeric gene may 
then be inserted in the adenovirus genome by in vitro or in vivo recombination. Inser-. 
tion in a non-essential region of the viral genome (e.g., region El or E3) will result in a 
recombinant virus that is viable and capable of expressing a NGPCR gene product in 
infected hosts (e.g.. See Logan & Shenk, 1984, Proc. Natl. Acad. Sci. USA 81:3655- 
3659). Specific initiation signals may also be required for efficient translation of inserted 
NGPCR nucleotide sequences. These signals include the ATG initiation codon and 
adjacent sequences. In cases where an entire NGPCR gene or cDNA, including its own 
initiation codon and adjacent sequences, is inserted into the appropriate expression 
vector, no additional translationai control signals may be needed. However, in cases 
where only a portion of a NGPCR coding sequence is inserted, exogenous translationai 
control signals, including, perhaps, the ATG initiation codon. typically are provided. 
Furthermore, the initiation codon typically must be in phase with the reading frame of 
the desired coding sequence to ensure translation of the entire insert. These 
exogenous translationai control signals and initiation codons can be of a variety of 
origins, both natural and synthetic. The efficiency of expression may be enhanced by 
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the inclusion of appropriate transcription enhancer elements, transcription tenninators, 
etc. (See Bittner et al., 1987, Methods in Enzymol. 153:516-544). 

[054] In addition, a host cell strain may be chosen that modulates the 
expression of the inserted sequences, or modifies and processes the gene product in 
the specific fashion desired. Such modifications (e.g., glycosylation) and processing 
(e.g., cleavage) of protein products may be important for the function of the protein. 
Different host cells have characteristic and specific mechanisms for the post-transta- 
tional processing and modification of proteins and gene products. Appropriate cell lines 
or host systems can be chosen to ensure the correct modification and processing of the 
foreign protein expressed. To this end, eukaryotic host cells which possess the cellular 
machinery for proper processing of the primary transcript, glycosylation, and 
phosphorylation of the gene product may be used. Such mammalian host cells include, 
but are not limited to, CHO, VERO, BHK, HeLa, COS, MDCK, 293, 3T3, and WI38 cell 
lines. 

[055] For long-term, high-yield production of recombinant proteins, stable 
expression is prefen-ed. For example, cell lines which stably express the NGPCR 
sequences described above may be engineered. Rather than using expression vectors 
that contain viral origins of replication, host cells can be transformed with DNA 
controlled by appropriate expression control elements (e.g., promoter, enhancer 
sequences, transcription terminators, polyadenylation sites, etc.), and a selectable 
marker. Following the introduction of the foreign DNA, engineered cells may be allowed 
to grow for 1-2 days in an enriched media, and then are switched to a selective media. 
The selectable marker in the recombinant plasmid confers resistance to the selection 
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and allows cells to stably integrate the plasmid into their chromosonnes and grow to 
form foci which in turn can be cloned and expanded into cell lines. This method may 
advantageously be used to engineer cell lines which express the NGPCR gene product. 
Such engineered cell lines may be particularly useful in screening and evaluation of 
compounds that affect the endogenous acWity of the NGPCR gene product. 

[056] A number of selection systems can be used, including but not limited to 
the herpes simplex virus thymidine kinase (Wigler, et a!., 1977, Cell 11:223), 
hypoxanthine-guanine phosphoribosyltransferase (Szybalska & Szybalski, 1962, Proc, 
NatK Acad. Sci. USA 48:2026), and adenine phosphoribosyltransferase (Lowy, et al., 
1980, Cell 22:817) genes can be employed in tk-. hgprt- or aprt- cells, respectively. 
Also, antimetabolite resistance can be used as the basis of selection for the following 
genes: dhfr, which confers resistance to methotrexate (Wigler, et a!., 1980, Natl. Acad. 
Sci. USA 77:3567; O'Hare. et al., 1981, Proc. Natl. Acad. Sci. USA 78:1527); gpt, which 
confers resistance to mycophenolic acid (Mulligan & Berg, 1981, Proc. Natl. Acad. Sci. 
USA 78:2072); neo, which confers resistance to the aminoglycoside G-418 (Colberre- 
Garapin, et al., 1981, J. Mol. Biol. 150:1); and hygro, which confers resistance to 
hygromycin (Santerre, et al., 1984, Gene 30:147). 

[057] Alternatively, any fusion protein can be readily purified by utilizing an 
antibody specific for the fusion protein being expressed. For example, a system 
described by Janknecht et al. allows for the ready purification of non-denatured fusion 
proteins expressed in human cell lines (Janknecht, et al., 1991, Proc. Natl. Acad. Sci. 
USA 88: 8972-8976). In this system, the gene of interest is subcloned into a vaccinia 
recombination plasmid such that the gene's open reading frame is translationally fused 
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to an amino-terminal tag having six histidine residues. Extracts from cells infected with 
recombinant vaccinia vims are loaded onto Ni2+»nitriloacetic acid-agarose columns and 
histidine-tagged proteins are selectively eluted with imidazole-containing buffers. 

[058] NGPCR gene products can also be expressed in transgenic animals. 
Animals of any species, including, but not limited to, worms, mice, rats, rabbits, guinea 
pigs, rodents, pigs, micro-pigs, birds, goats, farm animals, and non-human primates, 
e.g., baboons, monkeys, and chimpanzees may be used to generate NGPCR 
transgenic animals. 

[059] Any technique known in the art may be used to introduce a NGPCR 
transgene into animals to produce the founder lines of transgenic animals. Such 
techniques include, but are not limited to pronuciear microinjection (Hoppe, P.C. and 
Wagner, T.E., 1989, U.S. Pat. No. 4,873,191); retrovirus mediated gene transfer into 
germ lines (Van der Putten et aL, 1985, Proc. Natl. Acad. Sci,. USA 82:6148-6152); 
gene targeting in embryonic stem cells (Thompson et a!., 1989, Cell 56:313-321); 
electroporation of embryos (Lo, 1983. Mol Cell. Biol. 3:1803-1814); and sperm-mediated 
gene transfer (Lavitrano et al., 1989, Cell 57:717-723); etc. For a review of such 
techniques, see Gordon, 1989, Transgenic Animals, Intl. Rev. Cytol. 115:171-229, 
which is incorporated by reference herein in Jts entirety. 

[060] The present invention provides for transgenic animals that carry the 
NGPCR transgene in all their cells, as well as animals which carry the transgene in 
some, but not all their cells, i.e., mosaic animals or somatic cell transgenic animals. 
The transgene may be integrated as a single transgene or in concatamers, e.g., head- 
to-head tandems or head-to-tail tandems. The transgene may also be selectively 
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introduced into and activated In a particular cell type by following, for example, the 
teaching of Lasko et al., 1992, Proc. Natl. Acad. Sci. USA 89:6232-6236. The 
regulatory sequences required for such a cell-type specific activation will depend upon 
the particular cell type of interest, and will be apparent to those of skill in the art. 

[061] When it is desired that a NGPCR transgene be integrated into the 
chromosomal site of the endogenous NGPCR gene, gene targeting is preferred. Briefly, 
when such a technique is to be utilized, vectors containing some nucleotide sequences 
homologous to the endogenous NGPCR gene are designed for the purpose of 
integrating, via homologous recombination with chromosomal sequences, into and 
dismpting the function of the nucleotide sequence of the endogenous NGPCR gene 
(i.e., "knockout" animals). 

[062] The transgene can also be selectively introduced into a particular cell 
type, thus inactivating the endogenous NGPCR gene in only that cell type, by following, 
for example, the teaching of Gu et al., 1994, Science, 265:103-106. The regulatory 
sequences required for such a cell-type specific inactivation will depend upon the 
particular cell type of interest, and will be apparent to those of skill in the art. 

[063] Once transgenic animals have been generated, the expression of the 
recombinant NGPCR gene may be assayed utilizing standard techniques. Initial 
screening may be accomplished by Southern blot analysis or PCR techniques to 
analyze animal tissues to assay whether integration of the transgene has taken place. 
The level of mRNA expression of the transgene in the tissues of the transgenic animals 
may also be assessed using techniques which include but are not limited to Northem 
blot analysis of tissue samples obtained from the animal, in situ hybridization analysis, 
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and RT-PCR. Samples of NGPCR gene-expressing tissue, may also be evaluated 
immunocytochemically using antibodies specific for the NGPCR transgene product 

ANTIBODIES TO NGPCR PROTEINS 

[064] Antibodies that specifically recognize one or more epitopes of a NGPCR, 
or epitopes of conserved variants of a NGPCR, or peptide fragments of a NGPCR are 
also encompassed by the invention. Such antibodies include but are not limited to 
polyclonal antibodies, monoclonal antibodies (mAbs), humanized or chimeric 
antibodies, single chain antibodies. Fab fragments, F(ab')2 fragments, fragments 
produced by a Fab expression library, anti-idiotypic (anti-Id) antibodies, and epitope- 
binding fragments of any of the above. 

[065] The antibodies of the invention may be used, for example, in the 
detection of NGPCR in a biological sample and. may, therefore, be utilized as part of a 
diagnostic or prognostic technique whereby patients may be tested for abnormal 
amounts of NGPCR, Such antibodies may also be utilized in conjunction with, for 
example, compound screening schemes, as described below, for the evaluation of the 
effect of test compounds on expression and/or activity of a NGPCR gene product. 
Additionally, such antibodies can be used in conjunction gene therapy to, for example, 
evaluate the normal and/or engineered NGPCR-expressing cells prior to their 
introduction into the patient. Such antibodies may additionally be used as a method for 
the inhibition of abnonnal NGPCR activity. Thus, such antibodies may, therefore, be 
utilized as part of weight disorder treatment methods. 

[066] For the production of antibodies, various host animals may be 
immunized by injection with the NGPCR, an NGPCR peptide (e.g., one corresponding 
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to a functional domain of the receptor, such as an ECD, TM or CD), truncated NGPCR 
polypeptides (NGPCR in which one or more domains, e.g., a TM or CD, has been 
deleted), functional equivalents of the NGPCR or mutants of the NGPCR. Such host 
animals may include but are not limited to rabbits, mice, and rats, to name but a few. 
Various adjuvants may be used to increase the immunological response, depending on 
the host species, including but not limited to Freund's (complete and incomplete), 
mineral gels such as aluminum hydroxide, surface active substances such as 
lysolecithin, pluronic polyols, polyanions, peptides, oil emulsions, keyhole limpet 
hemocyanin, dinitrophenol, and potentially useful human adjuvants such as BCG 
(bacille Calmette-Guerin) and Corynebacterium parvum. Polyclonal antibodies are 
heterogeneous populations of antibody molecules derived from the sera of the 
immunized animals, 

[067] Monoclonal antibodies, which are homogeneous populations of 
antibodies to a particular antigen, may be obtained by any technique which provides for 
the production of antibody molecules by continuous cell lines in culture. These include, 
but are not limited to, the hybridoma technique of Kohler and Milstein. (1975, Nature 
256:495-497; and U.S. Patent No. 4,376,110), the human B-cell hybridoma technique 
(Kosbor et a!., 1983, Immunology Today 4:72; Cole et al., 1983, Proc. Natl. Acad. Sci. 
USA 80:2026-2030), and the EBV-hybridoma technique (Cole et a!., 1985, Monoclonal 
Antibodies And Cancer Therapy, Alan R. Liss. Inc., pp. 77-96). Such antibodies may be 
of any immunoglobulin class including IgG, IgM, IgE, IgA, IgD and any subclass thereof. 
The hybridoma producing the mAb of this invention may be cultivated in vitro or in vivo. 
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Production of high titers of mAbs in vivo malces this the presently preferred method of 
production, 

[068] In addition, techniques developed for the production of "chimeric 
antibodies" (IVlorrison et al., 1984, Proc. Natl, Acad. Sci., 81:6851-6855; Neubergeret 
al., 1984, Nature, 312:604-608; Taked.a et al., 1985, Nature, 314:452-454) by splicing 
the genes from a mouse antibody molecule of appropriate antigen specificity together 
with genes from a human antibody molecule of appropriate biological activity can be 
used, A chimeric antibody is a molecule in which different portions are derived from 
different animal species, such as those having a variable region derived from a murine 
mAb and a human immunoglobulin constant region. 

[069] Alternatively, techniques described for the production of single chain 
antibodies (U.S. Patent 4,946,778; Bird, 1988, Science 242:423-426; Huston et aL, 
1988, Proc. Natl. Acad. Sci. USA 85:5879-5883; and Ward et al., 1989, Nature 334:544- 
546) can be adapted to produce single chain antibodies against NGPCR gene products. 
Single chain antibodies are formed by linking the heavy and light chain fragments of the 
Fv region via an amino acid bridge, resulting in a single chain polypeptide. 

[070] Antibody fragments that recognize specific epitopes may be generated 
by known techniques. For example, such fragments include but are not limited to: the 
F(ab')2 fragments which can be produced by pepsin digestion of the antibody molecule 
and the Fab fragments which can be generated by reducing the disulfide bridges of the 
F(ab')2 fragments. Alternatively, Fab expression libraries may be constructed (Huse et 
al., 1989, Science, 246:1275-1281) to allow rapid and easy identification of monoclonal 
Fab fragments with the desired specificity. 
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[071] Antibodies to a NGPCR can, in turn, be utilized to generate anti-idiotype 

antibodies that "nnimic" a given NGPCR, using tecliniques well known to those skilled in 

the art. (See, e.g., Greenspan & Bona, 1993, FASEB J 7(5):437-444; and Nissinoff, 

1991 , J. Immunol. 147(8):2429-2438). For example antibodies which bind to a NGPCR 

ECD and competitively inhibit the binding of a ligahd of NGPCR can be used to 

generate anti-idiotypes that "mimic" a NGPCR ECD and, therefore, bind and neutralize 

a ligand. Such neutralizing anti-idiotypes or Fab fragments of such anti-idiotypes can 

be used in therapeutic regimens involving the NGPCR signaling pathway. 

DIAGNOSIS OF ABNORMALITIES RELATED TO A NGPCR 
[072] A variety of methods can be employed for the diagnostic and prognostic 

evaluation of disorders related to NGPCR function, and for the identification of subjects 

having a predisposition to such disorders. 

[073] Such methods can, for example, utilize reagents such as the NGPCR 
nucleotide sequences described in Section 5.1, and NGPCR antibodies, as described, 
in Section 5.3. Specifically, such reagents may be used, for example, for; (1) the 
detection of the presence of NGPCR gene mutations, or the detection of either over- or 
under-expression of NGPCR mRNA relative to a given phenotype; (2) the detection of 
either an over- or an under-abundance of NGPCR gene product relative to a given 
phenotype; and (3) the detection of perturbations or abnormalities in the signal 
transduction pathway mediated by NGPCR. 

[074] The methods described herein may be perfomied, for example, by 
utilizing pre-packaged diagnostic kits comprising at least one specific NGPCR 
nucleotide sequence or NGPCR antibody reagent described herein, which may be 
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conveniently used, e.g., in clinical settings, to diagnose patients exiiibiting body weight 
disorder abnormalities. 

[075] For the detection of NGPCR mutations, any nucleated cell can be used 
as a starting source for genomic nucleic acid. For the detection of NGPCR gene 
expression or NGPCR gene products, any cell type or tissue in which the NGPCR gene 
is expressed, such as, for example, stomach or brain cells can be utilized, 

[076] Nucleic acid-based detection techniques and peptide detection 

techniques are described below. 

DETECTION OF NGPCR GENES AND TRANSCRIPTS 
[077] Mutations within a NGPCR gene can be detected by utilizing a number of 

techniques. Nucleic acid from any nucleated cell can be used as the starting point for 

such assay techniques, and may be isolated according to standard nucleic acid 

preparation procedures which are well known to those of skill in the art. 

[078] DNA may be used in hybridization or amplification assays of biological 
samples to detect abnormalities involving NGPCR gene structure, including point 
mutations, insertions, deletions and chromosomal rearrangements. Such assays may 
include, but are not limited to. Southern analyses, single stranded conformational 
polymorphism analyses (SSCP), and PCR analyses. 

[079] Such diagnostic methods for the detection of NGPCR gene-specific 
mutations can involve for example, contacting and incubating nucleic acids including 
recombinant DNA molecules, cloned genes or degenerate variants thereof, obtained 
from a sample, e.g.. derived from a patient sample or other appropriate cellular source, 
with one or more labeled nucleic acid reagents including recombinant DNA molecules, 
cloned genes or degenerate variants thereof, as described in Section 5.1, under 
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conditions favorable for the specific annealing of these reagents to their complementary 
sequences within a given NGPCR gene. Preferably, the lengths of these nucleic acid 
reagents are at least 15 to 30 nucleotides. After incubation, all non-annealed nucleic 
acids are removed from the nucleic acid;NGPCR molecule hybrid. The presence of 
nucleic acids which have hybridized, if any such molecules exist, is then detected. 
Using such a detection scheme, the nucleic acid from the cell type or tissue of interest 
can be immobilized, for example, to a solid support such as a membrane, or a plastic 
surface such as that on a microtiter plate or polystyrene beads. In this case, after 
Incubation, non-annealed, labeled nucleic acid reagents of the type described in Section 
5.1 are easily removed. Detection of the remaining, annealed, labeled NGPCR nucleic 
acid reagents is accomplished using standard techniques well-known to those in the art. 
The NGPCR gene sequences to which the nucleic acid reagents have annealed can be 
compared to the annealing pattem expected from a normal NGPCR gene sequence in 
order to determine whether a NGPCR gene mutation is present. 

[080] Alternative diagnostic methods for the detection of NGPCR gene specific 
nucleic acid molecules, in patient samples or other appropriate cell sources, may 
involve their amplification, e.g., by PCR (the experimental embodiment set forth in 
Mullis, K.B,, 1987. U.S. Patent No. 4,683,202), followed by the detection of the amplified 
molecules using techniques well known to those of skill in the art. The resulting 
amplified sequences can be compared to those which would be expected if the nucleic 
acid being amplified contained only nonnal copies of a NGPCR gene in order to 
determine whether a NGPCR gene mutation exists. 
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[081] Additionally, well-known genotyping techniques can be performed to 
identify individuals carrying NGPCR gene mutations. Such techniques include, for 
example, the use of restriction fragment length polymorphisms (RFLPs), which involve 
sequence variations in one of the recognition sites for the specific restriction enzyme 
used. 

[082] Additionally, improved methods for analyzing DNA polymorphisms which 
can be utilized for the identification of NGPCR gene mutations have been described 
which capitalize on the presence of variable numbers of short, tandemly repeated DNA 
sequences between the restriction enzyme sites. For example, Weber (U.S. Pat. No. 
5.075,217. which is incorporated herein by reference in its entirety) describes a DNA 
marker based on length polymorphisms in blocks of (dC-dA)n-(dG-dT)n short tandem 
repeats. The average separation of (dC"dA)n-(dG-dT)n blocks is estimated to be 
30,000-60.000 bp. Markers which are so closely spaced exhibit a high frequency co- 
inheritance, and are extremely useful in the identification of genetic mutations, such as, 
for example, mutations within a given NGPCR gene, and the diagnosis of diseases and 
disorders related to NGPCR mutations. 

[083] Also, Caskey et al. (U.S. Pat. No. 5,364,759. which is incorporated 
herein by reference in its entirety) describe a DNA profiling assay for detecting short tri 
and tetra nucleotide repeat sequences. The process includes extracting the DNA of 
interest, such as the NGPCR gene, amplifying the extracted DNA, and labeling the 
repeat sequences to form a genotypic map of the individual's DNA. 

[084] The level of NGPCR gene expression can also be assayed by detecting 
and measuring NGPCR transcription. For example, RNA from a cell type or tissue 
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known, or suspected to express the NGPCR gene, such as brain, may be isolated and 
tested utilizing hybridization or PGR techniques such as are described, above. The 
isolated cells can be derived from cell culture or from a patient. The analysis of cells 
taken from culture may be a necessary step in the assessment of cells to be used as 
part of a cell-based gene therapy technique or, alternatively, to test the effect of 
compounds on the expression of the NGPCR gene. Such analyses may reveal both 
quantitative and qualitative aspects of the expression pattern of the NGPCR gene, 
including activation or inactivation of NGPCR gene expression. 

[085] In certain embodiments of such a detection scheme, cDNAs are 
synthesized from the RNAs of interest (e.g., by reverse transcription of the RNA 
molecule into cDNA). A sequence within the cDNA is then used as the template for a 
nucleic acid amplification reaction, such as a PGR amplification reaction, or the like. 
The nucleic acid reagents used as synthesis initiation reagents (e.g., primers) in the 
reverse transcription and nucleic acid amplificatron steps of this method can be chosen 
from among the NGPCR nucleic acid reagents described in Section 5.1 . The preferred 
lengths of such nucleic acid reagents are at least 9-30 nucleotides. For detection of the 
amplified product, the nucleic acid amplification may be perfomned using radioactively or 
non-radioactively labeled nucleotides. Altematively, enough amp^lified product may be 
made such that the product may be visualized by standard ethidium bromide staining, 
by utilizing any other suitable nucleic acid staining method, or by sequencing. 

[086] Additionally, it is possible to perform such NGPCR gene expression 
assays "in situ", i.e., directly upon tissue sections (fixed and/or frozen) of patient tissue 
obtained from biopsies or resections, such that no nucleic acid purification is necessary. 
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Nucleic acid reagents such as those described above may be used as probes and/or 

primers for such in situ procedures (See, for example. Nuovo, G.J., 1992, "PGR In Situ 

Hybridization: Protocols And Applications", Raven Press. NY). 

[087] Alternatively, if a sufficient quantity of the appropriate cells can be 

obtained, standard Northern analysis can be performed to determine the level of 

NGPCR mRNA expression. 

DETECTION OF NGPCR GENE PRODUCTS 
[088] Antibodies directed against wild type or mutant NGPCR gene products or 

conserved variants or peptide fragments thereof, which are discussed above, may also 

be used as diagnostics and prognostics, as described herein. Such diagnostic 

methods, may be used to detect abnormalities in the level of NGPCR gene expression, 

or abnormalities in the structure and/or temporal, tissue, cellular, or subcellular location 

of the NGPCR, and may be performed in vivo or in vitro, such as, for example, on 

biopsy tissue. 

[089] For example, antibodies directed to epitopes of the NGPCR ECD can be 
used in vivo to detect the pattern and level of expression of the NGPCR in the body. 
Such antibodies can be labeled, e.g., with a radio-opaque or other appropriate 
compound and injected into a subject in order to visualize binding to the NGPCR 
expressed in the body using methods such as X-rays, CAT-scans, or MRI. Labeled 
antibody fragments, e.g., the Fab or single chain antibody comprising the smallest 
portion of the antigen binding region, are preferred for this purpose to promote crossing 
the blood-brain barrier and permit labeling NGPCRs expressed in the brain. 

[090] Additionally, any NGPCR fusion protein or NGPCR conjugated protein 
whose presence can be detected, can be administered. For example, NGPCR fusion or 
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conjugated proteins labeled with a radio-opaque or other appropriate compound can be 
administered and visualized in vivo, as discussed, above for labeled antibodies. Further 
such NGPCR fusion proteins as AP-NGPCR on NGPCR-Ap fusion proteins can be 
utilized for in vitro diagnostic procedures. 

[091] Alternatively, immunoassays or fusion protein detection assays, as 
described above, can be utilized on biopsy and autopsy samples in vitro to permit 
assessment of the expression pattern of the NGPCR. Such assays are not confined to 
the use of antibodies that define a NGPCR ECD, but can include the use of antibodies 
directed to epitopes of any of the domains of a NGPCR, e.g., the ECD. the TM and/or 
CD. The use of each or all of these labeled antibodies will yield useful information 
regarding translation and intracellular transport of the NGPCR to the cell surface, and 
can identify defects in processing. 

[092] The tissue or cell type to be analyzed will generally include those 

which are known, or suspected, to express the NGPCR gene. The protein isolation 
methods employed herein may, for example, be such as those described in Harlow and 
Lane (Harlow, E. and Lane, D.. 1988, "Antibodies: A Laboratory Manual", Cold Spring 
Harbor Laboratory Press, Cold Spring Harbor, New York), which is incorporated herein 
by reference in its entirety. The isolated cells can be derived from cell culture or from a 
patient. The analysis of cells taken from culture may be a necessary step in the 
assessment of cells that could be used as part of a cell-based gene therapy technique 
or, alternatively, to test the effect of compounds on the expression of a NGPCR gene. 

[093] For example, antibodies, or fragments of antibodies, such as those 
described, above, in Section 5.3, useful in the present invention may be used to 



39 



wo 01/81411 



PCTAJSOl/13371 



quantitatively or qualitatively detect the presence of NGPCR gene products or 
conserved variants or peptide fragments thereof. This can be accomplished, for . 
example, by immunofluorescence techniques employing a fluorescently labeled 
antibody (see below, this Section) coupled with light microscopic, flow cytometric, or 
fluorimetric detection. Such techniques are especially preferred if such NGPCR gene 
products are expressed on the cell surface. 

[094] The antibodies (or fragments thereof) or NGPCR fusion or conjugated 
proteins useful in the present invention may, additionally, be employed histologically, as 
in immunofluorescence, immunoelectron microscopy or non-immuno assays, for In situ 
detection of NGPCR gene products or conserved variants or peptide fragments thereof, 
or for NGPCR binding (In the case of labeled NGPCR llgand fusion protein). 

[095] In situ detection may be accomplished by removing a histological 
specimen from a patient, and applying thereto a labeled antibody or fusion protein of the 
present invention. The antibody (or fragment) or fusion protein Is preferably applied by 
overlaying the labeled antibody (or fragment) onto a biological sample. Through the use 
of such a procedure, it is possible to determine not only the presence of a NGPCR gene 
product, or conserved variants or peptide fragments, or NGPCR binding, but also its 
distribution in the examined tissue. Using the present Invention, those of ordinary skill 
will readily perceive that any of a wide variety of histological methods (such as staining 
procedures) can be modified in order to achieve such in situ detection. 

[096] Immunoassays and non-immunoassays for NGPCR gene products or 
conserved variants or peptide fragments thereof will typically comprise incubating a 
sample, such as a biological fluid, a tissue extract, freshly harvested cells, or lysates of 
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cells which have been incubated in cell culture, in the presence of a detectably labeled 
antibody capable of identifying NGPCR gene products or conserved variants or peptide 
fragments thereof, and detecting the bound antibody by any of a number of techniques 
well-known in the art. 

[097] The biological sample may be brought in contact with and immobilized 
onto a solid phase support or carrier such as nitrocellulose, or other solid support which 
is capable of immobilizing cells, cell particles or soluble proteins. The support may then 
be washed with suitable buffers followed by treatment with the detectably labeled 
NGPCR antibody or NGPCR ligand fusion protein. The solid phase support may then 
be washed with the buffer a second time to remove unbound antibody or fusion protein. 
The amount of bound label on solid support may then be detected by conventional 
means. ^ 

[098] By "solid phase support or earner" Is intended any support capable of 
binding an antigen or an antibody. Well-known supports or carriers include glass, 
polystyrene, polypropylene, polyethylene, dextran, nylon, amylases, natural and 
modified celluloses, polyacrylamides, gabbros, and magnetite. The nature of the carrier 
can be either soluble to some extent or insoluble for the purposes of the present 
invention. The support material can have virtually any possible structural configuration 
so long as the coupled molecule is capable of binding to an antigen or antibody. Thus, 
the support configuration may be spherical, as in a bead, or cylindrical, as in the inside 
surface of a test tube, or the external surface of a rod. Altematively, the surface may be 
flat such as a sheet, test strip, etc. Preferred supports include polystyrene beads. 
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Those skilled in the art will know many other suitable carriers for binding antibody or 
antigen, or will be able to ascertain the same by use of routine experimentation. 

[099] The binding activity of a given lot of NGPCR antibody or NGPCR ligand 
fusion protein may be determined according to well known methods. Those skilled in 
the art will be able to detemiine operative and optimal assay conditions for each 
determination by employing routine experimentation. 

[0100] With respect to antibodies, one of the ways in which the NGPCR 
antibody can be detectably labeled is by linking the same to an enzyme and use in an 
enzyme immunoassay (EIA) (Voller, A., "The Enzyme Linked Immunosorbent Assay 
(ELISA)". 1978. Diagnostic Horizons 2:1-7, Microbiological Associates Quarterly 
Publication, Walkersville. MD); Voller, A, et a!., 1978. J. Clin. Pathol. 31:507-520; Butler. 
J.E., 1981, Meth. Enzymol. 73:482-523; Maggio, E. (ed.), 1980, Enzyme Immunoassay, 
CRC Press, Boca Raton, FL,; Ishikawa, E. eta!., (eds.), 1981, Enzyme Immunoassay, 
Kgaku Shoin, Tokyo). The enzyme that is bound to the antibody will react with an 
appropriate substrate, preferably a chromogenic substrate, in isuch a manner as to 
produce a chemical moiety which can be detected, for example, by spectrophotometric, 
fluorimetric or by visual means. Enzymes which can be used to detectably label the 
antibody include, but are not limited to, malate dehydrogenase, staphylococcal 
nuclease, delta-5-steroid isomerase, yeast alcohol dehydrogenase, alpha- 
glycerophosphate, dehydrogenase, triose phosphate isomerase, horseradish 
peroxidase, alkaline phosphatase, asparaginase, glucose oxidase, beta-galactosidase. 
ribonuclease, urease, catalase, glucose-6-phosphate dehydrogenase, glucoamylase 
and acetylcholinesterase. The detection can be accomplished by colorimetric methods 
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which employ a chromogenic substrate for the enzyme. Detection may also be 
accomplished by visual comparison of the extent of enzymatic reaction of a substrate in 
comparison with similarly prepared standards. 

[0101] Detection may also be accomplished using any of a variety of other 
immunoassays. For example, by radioactively labeling the antibodies or antibody 
fragments, it is possible to detect NGPCR through the use of a radioimmunoassay (RIA) 
(see, for example, Weintraub, B., Principles of Radioimmunoassays, Seventh Training 
Course on Radioligand Assay Techniques, The Endocrine Society, March, 1986, which 
is incorporated by reference herein). The radioactive isotope can be detected by such 
means as the use of a gamma counter or a scintillation counter or by autoradiography. 

[0102] It is also possible to label the antibody with a fluorescent compound. 
When the fluorescently labeled antibody is exposed to light of the proper wave length, 
its presence can then be detected due to fluorescence. Among the most commonly 
used fluorescent labeling compounds are fluorescein isothiocyanate, rhodamine, 
phycoerythrin, phycocyanin, allophycocyanin, o-phthaldehyde and fluorescamine. 

[01 03] The antibody can also be detectably labeled using fluorescence emitting 
metals such as 152Eu, or others of the lanthanide series. These metals can be 
attached to the antibody using such metal chelating groups as 
diethylenetriaminepentacetic acid (DTPA) or ethylenediaminetetraacetic acid (EDTA). 

[0104] The antibody also can be detectably labeled by coupling it to a 
chemiluminescent compound. The presence of the chemiluminescent-tagged antibody 
is then determined by detecting the presence of luminescence that arises during the 
course of a chemical reaction. Examples of particularly useful chemiluminescent 



43 



wo 01/81411 



PCTAJSOl/13371 



labeling compounds are luminol, isoluminol, theromatic acridinium ester, imidazole, 
acridinium salt and oxalate ester. 

[0105] Likewise, a bioluminescent compound may be used to label the antibody 
of the present invention. Bioluminescence is a type of chemiluminescence found in 
biological systems in, which a catalytic protein increases the efficiency of the 
chemiluminescent reaction. The presence of a bioluminescent protein is determined by 
detecting the presence of luminescence. Important bioluminescent compounds for 
purposes of labeling are luciferin, luciferase and aequorin, 

SCREENING ASSAYS FOR COMPOUNDS THAT MODULATE NGPCR EXPRESSION 

OR ACTIVITY 

[0106] The following assays are designed to identify compounds that interact 
with {e.g., bind to) NGPCRs (including, but not limited to an ECD or CD of a NGPCR), 
compounds that interact with (e.g., bind to) intracellular proteins that interact with 
NGPCR (including but not limited to the TM and CD of NGPCR), compounds that 
interfere with the interaction of NGPCR with transmembrane or intracellular proteins 
involved in NGPCR-mediated signal transduction, and to compounds which modulate 
the activity of NGPCR gene (/.e., modulate the level of NGPCR gene expression) or 
modulate the level of NGPCR. Assays may additionally be utilized which identify 
compounds which bind to NGPCR gene regulatory sequences (e.g., promoter 
sequences) and which may modulate NGPCR gene expression. See e.g., Piatt, K.A., 
1994, J. Biol. Chem. 269:28558-28562, which is incorporated herein by reference in its 
entirety. 

[01 07] The compounds that can be screened in accordance with the invention 
include but are not limited to peptides, antibodies and fragments thereof, and other 
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organic compounds (e.g., peptidomimetics) that bind to an ECD of a NGPCR and either 
mimic the activity triggered by the natural llgand (i.e., agonists) or inhibit the activity 
triggered by the natural ligand (i.e., antagonists); as well as peptides, antibodies or 
fragments thereof, and other organic compounds that mimic the ECD of the NGPCR (or 
a portion thereof) and bind to and "neutralize" the natural ligand. 

[0108] Such compounds may include, but are not limited to, peptides such as, 
for example, soluble peptides, Including but not limited to members of random peptide 
libraries; (see, e.g.. Lam, K.S. et al., 1991,, Nature 354:82-84; Houghten, R. et a!.. 1991 , 
Nature 354:84-86), and combinatorial chemistry-derived molecular library made of D- 
and/or L- configuration amino acids, phosphopeptides (including, but not limited to 
members of random or partially degenerate, directed phosphopeptide libraries; see, 
e.g., Songyang, 2. et a!., 1993, Cell 72:767-778). antibodies (including, but not limited 
to, polyclonal, monoclonal, humanized, anti-idiotypic, chimeric or single chain 
antibodies, and FAb, F(ab')2 and FAb expression library fragments, and epitope-binding 
fragments thereof), and small organic or inorganic molecules. 

[0109] Other compounds which can be screened in accordance with the 
invention include but are not limited to small organic molecules that are able to cross 
the blood-brain barrier, gain entry into an appropriate cell (e.g., in the cerebellum, the 
hypothalamus, etc.) and affect the expression of a NGPCR gene or some other gene 
involved in the NGPCR signal transduction pathway (e.g.. by interacting with the 
regulatory region or transcription factors involved in gene expression); or such 
compounds that affect the activity of the NGPCR (e.g., by inhibiting or enhancing the 
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-^enzymatic activity of a CD) or the activity of some other Intracellular factor involved in 
the NGPCR signal transduction pathway. 

[0110] Computer modeling and searching technologies permit identification of 
compounds, or the improvement of already identified compounds, that can modulate 
NGPCR expression or activity. Having identified such a compound or composition, the 
active sites or regions are identified. Such active sites might typically be llgand binding 
sites. The active site can be identified using methods known in the art including, for 
example, from the amino acid sequences of peptides, from the nucleotide sequences of 
nucleic acids, or from study of complexes of the relevant compound or composition with 
its natural ligand. In the latter case, chemical or X-ray crystallographic methods can be 
used to find the active site by finding where on the factor the complexed ligand is found. 

[0111] Next, the three dimensional geometric structure of the active site Is 
determined. This can be done by known methods, including X-ray crystallography, 
which can determine a complete molecular structure. On the other hand, solid or liquid 
phase NMR can be used to determine certain intra-molecular distances. Any other 
experimental method of structure determination can be used to obtain partial or 
complete geometric structures. The geometric structures may be measured with a 
complexed ligand, natural or artificial, which may increase the accuracy of the active 
site structure determined. 

[01 12] If an incomplete or insufficiently accurate structure is determined, the 
methods of computer based numencal modeling can be used to complete the structure 
or improve its accuracy. Any recognized modeling method may be used, including 
parameterized models specific to particular biopolymers such as proteins or nucleic 
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acids, molecular dynamics models based on computing molecular motions, statistical 
mechanics models based on thennal ensembles, or combined models. For most types 
of models, standard molecular force fields, representing the forces between constituent 
atoms and groups, are necessary, and can be selected from force fields known in 
physical chemistry. The incomplete or less accurate experimental structures can serve 
as constraints on the complete and more accurate structures computed by these 
modeling methods. 

[01 1 3] Finally, having determined the structure of the active site, either 
experimentally, by modeling, or by a combination, candidate modulating compounds 
can be identified by searching databases containing compounds along with infonmation 
on their molecular structure. Such a search seeks compounds having structures that 
match the determined active site structure and that interact with the groups defining the 
active site. Such a search can be manual, but is preferably computer assisted. These 
compounds found from this search are potential NGPCR modulating compounds. 

[01 14] Altematively, these methods can be used to identify improved 
modulating compounds from an already known modulating compound or ligand. The 
composition of the known compound can be modified and the structural effects of 
modification can be determined using the experimental and computer modeling 
methods described above applied to the new composition. The altered structure is then 
compared to the active site structure of the compound to determine if an improved fit or 
interaction results. In this manner systematic variations in composition, such as by 
varying side groups, can be quickly evaluated to obtain modified modulating compounds 
or ligands of improved specificity or activity. 
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[01 15] Further experimental and computer modeling methods useful to identify 
modulating compounds based upon identification of the active sites of a NGPCR, and 
related transduction and transcription factors will be apparent to those of skill in the art. 

[0116] Examples of molecular modeling systems are the CHARMm and 
QUANTA programs (Polygen Corporation, Waltham, MA). CHARMm performs the 
energy minimization and molecular dynamics functions. QUANTA performs the 
construction, graphic modeling and analysis of molecular structure. QUANTA allows 
interactive construction, modification, visualization, and analysis of the behavior of 
molecules with each other. 

[Oil 7] A number of articles review computer modeling of drugs interactive with 
specific proteins, such as Rotivinen, et a!., 1988, Acta Pharmaceutical Fennica 97:159- 
166; Ripka, New Scientist 54-57 (June 16, 1988); McKinaly and Rossmann, 1989, 
Annu. Rev. Pharmacol. Toxiciol. 29:111-122; Peny and Davies, OSAR: Quantitative 
Structure-Activity Relationships in Drug Design pp. 189-193 (Alan R. Liss, Inc. 1989); 
Lewis and Dean. 1989 Proc. R. Soc, Lond. 236:125-140 and 141-162; and, with respect 
to a model receptor for nucleic acid components. Askew, et al., 1989, J. Am. Cham. 
Soc. 1 1 1:1 082-1 090. Other computer programs that screen and graphically depict 
chemicals are available from companies such as BioDesign, Inc. (Pasadena, CA.), 
Allelix, Inc. (Mississauga, Ontario, Canada), and Hypercube, Inc. (Cambridge, Ontario). 
Although these are primarily designed for application to drugs specific to particular 
proteins, they can be adapted to design of drugs specific to regions of DNA or RNA, 
once that region is identified. 
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[01 18] Although described above with reference to design and generation of 
compounds which could alter binding, one could also screen libraries of known 
connpounds, including natural products or synthetic chemicals, and biologically active 
materials, including proteins, for compounds which are inhibitors or activators. 

[01 19] Cell-based systems can also be used to identify compounds that bind 
NGPCRs as well as assess the altered activity associated with such binding in living 
cells. Assays for agonists and antagonists of NGPCRS that can be used in cell-based 
systems according to certain embodiments include, but are not limited to, those de- 
scribed in U.S. Patent Serial No. 6,004,808, and PCT Application Number US99/17425. 
which are herein incorporated by reference in their entirety for any purpose. 

[01 20] One tool of particular interest for such assays is green fluorescent 
protein which is described, inter alia, in U.S. Patent No. 5,625,048, herein incorporated 
by reference. Cells that may be used in such cellular assays include, but are not limited 
to, leukocytes, or cell lines derived from leukocytes, lymphocytes, stem cells, including 
embryonic stem cells, and the like. In addition, expression host cells (e.g., B95 cells, 
COS cells, CHO cells, OMK cells, fibroblasts, Sf9 cells) genetically engineered to 
express a functional NGPCR of interest and to respond to activation by the test, or 
natural, ligand, as measured by a chemical or phenotypic change, or induction of 
another host cell gene, can be used as an end point in the assay. 

[0121] Compounds identified via assays such as those described herein may be 
useful, for example, in elaborating certain biological functions of a NGPCR gene 
product. Such compounds can be administered to a patient at therapeutically effective 
doses to treat any of a variety of physiological or mental disorders. A therapeutically 
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effective dose refers to that amount of the compound sufficient to result in any 
amelioration, impediment, prevention, or alteration of any biological or overt symptom. 

[0122] Toxicity and therapeutic efficacy of such compounds can be determined 
by standard pharmaceutical procedures in cell cultures or experimental animals, e.g., 
for determining the LD50 (the dose lethal to 50% of the population) and the ED50 (the 
dose therapeutically effective in 50% of the population). The dose ratio between toxic 
and therapeutic effects is the therapeutic index and it can be expressed as the ratio 
LD50/ED50. Compounds which exhibit large therapeutic indices are prefen^ed. While 
compounds that exhibit toxic side effects may be used, typically care should be taken to 
design a delivery system that targets such compounds to the site of affected tissue in 
order to minimize potential damage to uninfected cells and, thereby, reduce side effects. 

[0123] The data obtained from the cell culture assays and animal studies can be 
used In fomiulating a range of dosage for use in humans. The dosage of such 
compounds lies preferably within a range of circulating concentrations that include the 
ED50 with little or no toxicity. The dosage may vary within this range depending upon 
the dosage form employed and the. route of administration utilized. For any compound 
used in the method of the invention, the therapeutically effective dose can be estimated 
initially from cell culture assays. A dose may be formulated in animal models to achieve 
a circulating plasma concentration range that includes the IC50 (i.e., the concentration 
of the test compound which achieves a half-maximal inhibition of symptoms) as 
determined in cell culture. Such information can be used to more accurately determine 
useful doses in humans. Levels in plasma may be measured, for example, by high 
perfonnance liquid chromatography. 



50 



wo 01/81411 



PCTAJSOl/13371 



[0124] Pharmaceutical compositions for use in accordance with the present 
invention may be formulated in conventional manner using one or more physiologically 
acceptable carriers or excipients. Thus, the compounds and their physiologically 
acceptable salts and solvates may be formulated for administration by inhalation or 
insufflation (either through the mouth or the nose) or oral, buccal, parenteral, 
Intracranial, intrathecal, or rectal administration. 

[0125] For oral administration, the pharmaceutical compositions may take the 
form of, for example, tablets or capsules prepared by conventional means with 
pharmaceutically acceptable excipients such as binding agents (e.g., pregelatinised 
maize starch, polyvinylpyrrolidone or hydroxypropyl methylcellulose); fillers (e.g., 
lactose, microcrystalline cellulose or calcium hydrogen phosphate); lubricants (e.g., 
magnesium stearate, talc or silica); disintegrants (e.g., potato starch or sodium starch 
glycolate); or wetting agents (e.g., sodium lauryl sulphate). The tablets may be coated 
by methods well known in the art. Liquid preparations for oral administration may take 
the fomri of, for example, solutions, syrups or suspensions, or they may be presented as 
a dry product for constitution with water or other suitable vehicle before use. Such liquid 
preparations may be prepared by conventional means with pliarmaceutically acceptable 
additives such as suspending agents (e.g., sori^itol syrup, cellulose derivatives or 
hydrogenated edible fats); emulsifying agents (e.g., lecithin or acacia); non-aqueous 
vehicles (e.g., almond oil, oily esters, ethyl alcohol or fractionated vegetable oils); and 
preservatives (e.g., methyl or propyl-p-hydroxybenzoates or sorbic acid). The 
preparations may also contain buffer salts, flavoring, coloring and sweetening agents as 
appropriate. 
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[01 26] Preparations for oral administration may be suitably formulated to give 
controlled release of the active compound. 

[01 27] For buccal administration the compositions may take the form of tablets 
or lozenges formulated in conventional manner. 

[0128] For administration by inhalation, the compounds for use according to the 
present invention are conveniently delivered in the form of an aerosol spray 
presentation from pressurized packs or a nebulizer, with the use of a suitable propellant. 
e;g.. dichlorodifluoromethane, trichlorofluoromethane, dichlorotetrafluoroethane, carbon 
dioxide or other suitable gas. In the case of a pressurized aerosol the dosage unit may 
be determined by providing a valve to deliver a metered amount. Capsules and 
cartridges of e.g. gelatin for use in an inhaler or insufflator may be formulated containing 
a powder mix of the compound and a suitable powder base such as lactose or starch. 

[0129] The compounds may be formulated for parenteral administration by 
injection, e.g., by bolus injection or continuous infusion. Formulations for injection may 
be presented in unit dosage form, e.g., in ampoules or in multi-dose containers, with an 
added preservative. The compositions may take such forms as suspensions, solutions 
or emulsions in oily or aqueous vehicles, and may contain formulatory agents such as 
suspending, stabilizing and/or dispersing agents. Alternatively, the active ingredient 
may be in powder forni for constitution with a suitable vehicle, e.g., sterile pyrogen-free 
water, before use. 

[0130] The compounds may also be formulated in rectal compositions such as 
suppositories or retention enemas, e.g., containing conventional suppository bases 
such as cocoa butter or other glycerides. 
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[0131] In addition to the formulations described previously, the compounds may 
also be formulated as a depot preparation. Such long acting formulations may be 
administered by implantation (for example subcutaneously or intramusculariy) or by 
intramuscular injection. Thus, for example, the compounds may be formulated with 
suitable polymeric or hydrophobic materials (for example as an emulsion in an 
acceptable oil) or ion exchange resins, or as sparingly soluble derivatives, for example, 
as a sparingly soluble salt. 

[0132] The compositions may, if desired, be presented in a pack or dispense!^ 
device which may contain one or more unit dosage forms containing the active 
ingredient. The pack may for example comprise metal or plastic foil, such as a blister 
pack. The pack or dispenser device may be accompanied by instructions for 
administration. 

IN VITRO SCREENING ASSAYS FOR COMPOUNDS THAT BIND TO NGPCRs 
[0133] in vitro systems may be designed to identify compounds capable of 

interacting with (e.g., binding to) NGPCR (including, but not limited to, a ECD or CD of 

NGPCR). Compounds identified may be useful, for example, in modulating the activity 

of wild type and/or mutant NGPCR gene products; may be useful in elaborating certain 

biological functions of the NGPCR; may be utilized in screens for Identifying compounds 

that disrupt normal NGPCR Interactions; or may In themselves disrupt such interactions, 

[0134] The principle of the assays used to identify compounds that bind to the 

NGPCR Involves preparing a reaction mixture of the NGPCR and the test compound 

under conditions and for a time sufficient to allow the two components to interact and 

bind, thus forming a complex which can be removed and/or detected in the reaction 

mixture. The NGPCR species used can vary depending upon the goal of the screening 
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assay. For example, in certain embodiments, where agonists of the natural ligand are 
sought, the full length NGPCR, or a soluble truncated NGPCR, e.g., in which the TM 
and/or CD is deleted from the molecule, a peptide corresponding to'a ECD or a fusion 
protein containing one or more NGPCR ECD fused to a protein or polypeptide that 
affords advantages in the assay system (e.g., labeling, isolation of the resulting 
complex, etc.) can be utilized. Where compounds that interact with the cytoplasmic 
domain are sought to be identified, peptides corresponding to the NGPCR CD and 
fusion proteins containing the NGPCR CD can be used. 

[01 35] The screening assays can be conducted in a variety of ways. For 
example, one method to conduct such an assay would involve anchoring the NGPCR 
protein, polypeptide, peptide or fusion protein or the test substance onto a solid phase 
and detecting NGPCR/test compound complexes anchored on the solid phase at the 
end of the reaction. In certain embodiments of such a method, the NGPCR reactant 
may be anchored onto a solid surface, and the test compound, which is not anchored, 
may be labeled, either directly or indirectly, 

[01 36] In practice, microtiter plates may conveniently be utilized as the solid 
phase. The anchored component may be immobilized by non-covalent or covalent 
attachments. Non-covalent attachment may be accomplished by simply coating the 
solid surface with a solution of the protein and drying. Alternatively, an immobilized 
antibody, preferably a monoclonal antibody, specific for the protein to be immobilized 
may be used to anchor the protein to the solid surface. The surfaces may be prepared 
in advance and stored. 
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[0137] In order to conduct the assay, the nonimmpbilized component is- added 
to the coated surface containing the anchored component. After the reaction is 
complete, unreacted components are removed (e.g., by washing) under conditions such 
that any complexes formed will remain immobilized on the solid surface. The detection 
of complexes anchored on the solid surface can be accomplished in a number of ways. 
Where the previously nonimmobilized component is pre-labeled, the detection of label 
immobilized on the surface indicates that complexes were formed. Where the 
previously nonimmobilized component is not pre-labeled, an indirect label can be used 
to detect complexes anchored on the surface; e.g.. using a labeled antibody specific for 
the previously nonimmobilized component (the antibody, in turn, may be directly labeled 
or indirectly labeled with a labeled anti-lg antibody). 

[0138] Alternatively, a reaction can be conducted in a liquid phase, the reaction 
products separated from unreacted components, and complexes detected; e.g., using 
an immobilized antibody specific for a NGPCR protein, polypeptide, peptide or fusion 
protein or the test compound to anchor any complexes formed in solution, and a labeled 
antibody specific for the other component of the possible complex to detect anchored 
complexes. 

[0139] Altematively, cell-based assays can be used to identify compounds that 
interact with NGPCR. To this end, cell lines that express NGPCR, or cell lines (e.g., 
COS cells, CHO cells, fibroblasts, etc.) that have been genetically engineered to 
express a NGPCR (e.g., by transfection or transduction of NGPCR DNA) can be used, 
Interaction of the test compound with, for example, a ECD of a NGPCR expressed by 
the host cell can be determined by comparison or competition with native ligand. 
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5.5.2. ASSAYS FOR INTRACELLULAR PROTEINS THAT INTERACT WITH NGPCRs 

[0140] Any method suitable for detecting protein-protein interactions may be 
employed for identifying transmembrane proteins or intracellular proteins that interact 
with a NGRCR. Among the traditional methods which may be employed are 
co-immunoprecipitation, crosslinking and co-purification through gradients or 
chromatographic columns of cell lysates or proteins obtained from cell lysates and a 
NGPCR to identify proteins In the lysate that interact with the NGPCR. For these 
assays, the NGPCR component used can be a full length NGPCR, a soluble derivative 
lacking the membrane-anchoring region (e.g., a truncated NGPCR in which a TM is 
deleted resulting in a truncated molecule containing a ECD fused to a CD), a peptide 
corresponding to a CD or a fusion protein containing a CD of a NGPCR, Once isolated, 
such an intracellular protein can be identified and can, in turn, be used, in conjunction 
with standard techniques, to identify proteins with which It interacts. For example, at 
least a portion of the amino acid sequence of an intracellular protein which interacts with 
a NGPCR can be ascertained using techniques well known to those of skill in the art, 
such as via the Edman degradation technique. (See, e.g., Creighton, 1983, "Proteins: 
Structures and Molecular Principles", W.H. Freeman & Co., N.Y., pp.34-49). The amino 
acid sequence obtained may be used as a guide for the generation of oligonucleotide , 
mixtures that can be used to screen for gene sequences encoding such intracellular 
proteins. Screening can be accomplished, for example, by standard hybridization or 
PCR techniques. Techniques for the generation of oligonucleotide mixtures and the 
screening are well-known. (See, e.g., Ausubel, supra, and PCR Protocols: A Guide to 
Methods and Applications, 1990, Innis, M. etaL, eds. Academic Press, Inc., New York). 
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[0141] Additionally, methods may be employed which result in the simultaneous 
identification of genes which encode the transmembrane or intracellular proteins 
interacting with NGPCR. These methods include, for example, probing expression, 
libraries, in a manner similar to the well known technique of antibody probing of Agt1 1 
libraries, using labeled NGPCR protiein, or an NGPCR polypeptide, peptide or fusion 
protein, e.g., an NGPCR polypeptide or NGPCR domain fused to a marker (e.g., an 
enzyme, fluor, luminescent protein, or dye), or an Ig-Fc domain. 

[0142] One method that detects protein interactions in vivo, the two-hybrid 
system, is described In detail for illustration only and not by way of limitation. One 
version of this system has been described (Chien et al., 1991, Proc. Natl. Acad. Sci. 
USA, 88:9578-9582) and is commercially available from Clontech (Palo Alto, OA). 

[0143] Briefly, utilizing such a system, plasmids are constructed that encode two 
hybrid proteins: one plasmid has nucleotides encoding the DNA-binding domain of a 
transcription activator protein fused to a NGPCR nucleotide sequence encoding 
NGPCR, an NGPCR polypeptide, peptide or fusion protein, and the other plasmid has 
nucleotides encoding the transcription activator protein's activation domain fused to a 
cDNA encoding an unknown protein which has been recombined into this plasmid as 
part of a cDNA library. The DNA-binding domain fusion plasmid and the cDNA library 
are transfomned into a strain of the yeast Saccharomyces cerevisiae that contains a 
reporter gene (e.g., HBS or lacZ) whose regulatory region contains the transcription 
activator's binding site. Either hybrid protein alone cannot activate transcription of the 
reporter gene: the DNA-binding domain hybrid cannot because it does not provide 
activation function and the activation domain hybrid cannot because it cannot localize to 
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the activator's binding sites. Interaction of the two hybrid proteins reconstitutes the 
functional activator protein and results in expression of the reporter gene, which is 
detected by an assay for the reporter gene product. 

[0144] The two-hybrid system or related methodology may be used to screen 
activation domain libraries for proteins that interact with the "bair gene product. By way 
of example, and not by way of limitation, a NGPCR may be used as the bait gene 
product. Total genomic or cDNA sequences are fused to the DNA encoding an 
activation domain. This library and a plasmid encoding a hybrid of a bait NGPCR gene 
product fused to the DNA-binding domain are cotransformed into a yeast reporter strain, 
and the resulting transformants are screened for those that express the reporter gene. 
For example, and not by way of limitation, a bait NGPCR gene sequence, such as the 
open reading frame of a NGPCR (or a domain of a NGPCR) can be cloned into a vector 
such that it is translationally fused to the DNA encoding the DNA-binding domain of the 
GAL4 protein. These colonies are purified and the library plasmids responsible for 
reporter gene expression are isolated. DNA sequencing is then used to identify the 
proteins encoded by the library plasmids. 

[0145] A cDNA library of the cell line from which proteins that interact with bait 
NGPCR gene product are to be detected can be made using methods routinely 
practiced in the art. According to certain embodiments of the system described herein, 
for example, the cDNA fragments can be inserted into a vector such that they are 
translationally fused to the transcriptional activation domain of GAL4. This library can 
be co-transfonned along with the bait NGPCR gene-GAL4 fusion plasmid into a yeast 
strain which contains a lacZ gene driven by a promoter which contains GAL4 activation 
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sequence. A cDNA encoded protein, fused to GAL4 transcriptional activation domain, 
that interacts with bait NGPCR gene product will reconstitute an active GAL4 protein 
and thereby drive expression of the HISS gene. Colonies which express HISS can be 
detected by their growth on petri dishes containing semi-solid agar based media lacking 
histidine. The cDNA can then be purified from these strains, and used to produce and 
isolate the bait NGPCR gene-interacting protein using techniques routinely practiced in 
the art. 

ASSAYS FOR COMPOUNDS THAT INTERFERE WITH NGPCR/INTRACELLULAR 
OR NGPCR/TRANSMEMBRANE MACROMOLECULE INTERACTION 

[0146] The macromolecules that interact with the NGPCR are referred to, for 
purposes of this discussion, as "binding partners." These binding partners are likely to 
be involved in the NGPCR signal transduction pathway. Therefore, it is desirable to 
identify compounds that interfere with or dismpt the interaction of such binding partners 
which may be useful in regulating the activity of a NGPCR and controlling disorders 
associated with NGPCR activity. For example, given their expression pattern, the 
described NGPCRs are contemplated to be particularly useful in methods for identifying 
compounds useful in the therapeutic treatment of obesity, inflammation, immune 
disorders, diabetes, heart and coronary disease, metabolic disorders, and cancer. 

[0147] The basic principle of the assay systems used to identify compounds that 
interfere with the interaction between a NGPCR and its binding partner or partners 
involves preparing a reaction mixture containing NGPCR protein, polypeptide, peptide 
or fusion protein as described in Sections 5.5.1 and 5.5.2 above, and the binding 
partner under conditions and for a time sufficient to allow the two to interact and bind, 
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thus forming a complex. In order to test a compound for inhibitory activity, the reaction 
mixture is prepared in the presence and absence of the test compound. The test 
compound may be initially included in the reaction mixture, or may be added at a time 
subsequent to the addition of the NGPCR moiety and its binding partner. Control 
reaction mixtures are incubated without the test compound or with a placebo. The 
formation of any complexes between the NGPCR moiety and the binding partner is then 
detected. The formation of a complex in the control reaction, but not in the reaction 
mixture containing the test compound, indicates that the compound interferes with the 
interaction of the NGPCR and the interactive binding partner. Additionally, complex 
formation within reaction mixtures containing the test compound and normal NGPCR 
protein may also be compared to complex formation within reaction mixtures containing 
the test compound and a mutant NGPCR. This comparison may be important in those 
cases wherein it is desirable to identify compounds that specifically disrupt interactions 
of mutant, or mutated, NGPCRs but not normal NGPCRs. 

[0148] According to certain embodiments, the assay for compounds that 
interfere with the interaction of a NGPCR and its binding partners can be conducted in a 
heterogeneous or homogeneous fomiat. Heterogeneous assays involve anchoring 
either the NGPCR moiety product or the binding partner onto a solid phase and 
detecting complexes anchored on the solid phase at the end of the reaction. In 
homogeneous assays, the entire reaction is carried out in a liquid phase. In either 
approach, the order of addition of reactants can be varied to obtain different information 
about the compounds being tested. For example, test compounds that interfere with the 
interaction by competition can be identified by conducting the reaction in the presence 
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of the test substance; i.e., by adding the test substance to the reaction mixture prior to. 
or simultaneously with, a NGPCR moiety and interactive binding partner. Alternatively, 
test compounds that disrupt prefomried complexes, e.g. compounds with higher binding 
constants that displace one of the components from the complex, can be tested by 
adding the test compound to the reaction mixture after complexes have been formed. 
Various formats, according to certain embodiments, are described briefly below. 

[0149] In a heterogeneous assay system, either a NGPCR moiety or an 
interactive binding partner, is anchored onto a solid surface, while the non-anchored 
species is labeled, either directly or indirectly. In practice, microtiter plates are 
conveniently utilized. The anchored species may be immobilized by non-covalent or 
covalent attachments. Non-covalent attachment may be accomplished simply by 
coating the solid surface with a solution of the NGPCR gene product or binding partner 
and drying. Altematively, an immobilized antibody specific for the species to be 
anchored may be used to anchor the species to the solid surface. The surfaces may be 
prepared in advance and stored, 

[01 50] In order to conduct the assay, the partner of the Immobilized species is 
exposed to the coated surface with or without the test compound. After the reaction is 
complete, unreacted components are removed (e.g., by washing) and any complexes 
formed will remain immobilized on the solid surface. The detection of complexes 
anchored on the solid surface can be accomplished in a number of ways. Where the 
non-immobilized species is pre-labeled, the detection of label immobilized on the 
surface indicates that complexes were formed. Where the non-immobilized species is 
not pre-labeled, an indirect label can be used to detect complexes anchored on the 
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surface; e.g., using a labeled antibody specific for the initially non-immobilized species 
(the antibody, in turn, may be directly labeled or indirectly labeled with a labeled anti-lg 
antibody). Depending upon the order of addition of reaction components, test 
compounds which inhibit complex formation or which disrupt preformed complexes can 
be detected. 

[0151] Alternatively, the reaction can be conducted in a liquid phase in the 
presence or absence of the test compound, the reaction products separated from 
unreacted components, and complexes detected; e.g., using an immobilized antibody 
specific for one of the binding components to anchor any complexes formed in solution, 
and a labeled antibody specific for the other partner to detect anchored complexes. 
Again, depending upon the order of addition of reactants to the liquid phase, test 
compounds which inhibit complex or which disrupt preformed complexes can be 
identified. 

[0152] In certain embodiments of the invention, a homogeneous assay can be 
used in which a preformed complex of a NGPCR moiety and an interactive binding 
partner is prepared in which either the NGPCR or its binding partner is labeled, but the 
signal generated by the label is quenched due to formation of the complex (see, e.g., 
U.S. Patent No. 4,109,496 by Rubenstein which utilizes this approach for 
immunoassays). The addition of a test substance that competes with and displaces one 
of the species from the preformed complex will result in the generation of a signal above 
background. In this way, test substances which disrupt NGPCR/intracellular binding 
partner interaction can be identified. 
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[0153] In certain embodiments, a NGPCR fusion can be prepared for 
immobilization. For example, a NGPCR or a peptide fragment, e.g., corresponding to a 
CD. can be fused to a glutathlone-S-transferase (GST) gene using a fusion vector, such 
as pGEX-5X-1 , in such a manner that its binding activity is maintained in the resulting 
fusion protein. The interactive binding partner can be purified and used to raise a 
monoclonal antibody, using methods routinely practiced in the art and described above, 
in Section 5.3. This antibody can be labeled with the radioactive isotope 1251, for 
example, by methods routinely practiced in the art. In a heterogeneous assay, e.g., the 
GST-NGPCR fusion protein can be anchored to glutathione-agarose beads. The 
Interactive binding partner can then be added in the presence or absence of the test 
compound in a manner that allows interaction and binding to occur. At the end of the 
reaction period, unbound material can be washed away, and the labeled monoclonal 
antibody can be added to the system and allowed to bind to the complexed 
components. The interaction between a NGPCR gene product and the interactive 
binding partner can be detected by measuring the amount of radioactivity that remains 
associated with the glutathione-agarose beads. A successful Inhibition of the interaction 
by the test compound will result in a decrease in measured radioactivity. 

[0154] Alternatively, the GST-NGPCR fusion protein and the interactive binding 
partner can be mixed together in liquid in the absence of the solid glutathione-agarose 
beads. The test compound can be added either during or after the species are allowed 
to interact. This mixture can then be added to the glutathione-agarose beads and 
unbound material is washed away. Again the extent of inhibition of the NGPCR/binding 
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partner interaction can be detected by adding the labeled antibody and measuring the 
radioactivity associated with the beads. 

[0155] In certain embodiments of the invention, these same techniques can be 
employed using peptide fragments that con^espond to the binding domains of a NGPCR 
and/or the interactive or binding partner (in cases where the binding partner is a 
protein), in place of one or both of the full length proteins. Any number of methods 
routinely practiced in the art can be used to identify and isolate the binding sites. These 
methods include, but are not limited to, mutagenesis of the gene encoding one of the 
proteins and screening for disruption of binding in a co-immunoprecipitation assay. 
Compensatory mutations in the gene encoding the second species in the complex can 
then be selected. Sequence analysis of the genes encoding the respective proteins will 
reveal the mutations that correspond to the region of the protein involved in interactive 
binding. Alternatively, one protein can be anchored to a solid surface using methods 
described above, and allowed to interact with and bind to its labeled binding partner, 
which has been treated with a proteolytic enzyme, such as trypsin. After washing, a 
relatively short, labeled peptide comprising the binding domain may remain associated 
with the solid material, which can be isolated and identified by amino acid sequencing. 
Also, once the gene coding for the intracellular binding partner is obtained, short gene 
segments can be engineered to express peptide fragments of the protein, which can 
then be tested for binding activity and purified or synthesized. 

[01 56] For example, and not by way of limitation, a NGPCR gene product can 
be anchored to a solid material as described, above, by making a GST-NGPCR fusion 
protein and allowing it to bind to glutathione agarose beads. The interactive binding 

64 



1 



wo 01/81411 PCT/USOl/13371 

partner can be labeled with a radioactive isotope, such as 35S, and cleaved with a 
proteolytic enzyme such as trypsin. Cleavage products can then be added to the 
anchored GST-NGPCR fusion protein and allowed to bind. After washing away 
unbound peptides, labeled bound material, representing the intracellular binding partner 
binding domain, can be eluted, purified, and analyzed for amino acid sequence by well- 
known methods. Peptides so identified can be produced synthetically or fused to 
appropriate facilitative proteins using recombinant DNA technology. 

[0157] The present invention is not to be limited In scope by the specific 
embodiments described herein, which are intended as illustrations of individual aspects 
of certain embodiments of the invention, and functionally equivalent methods and 
components are within the scope of the invention. Indeed, various modifications of the 
invention, in addition to those shown and described herein will become apparent to 
those skilled in the art from the foregoing description and accompanying drawings. 
Such modifications are intended to fall within the scope of the appended claims. All 
referenced publications, patents, and patent applications are incorporated by reference 
herein for any purpose. 
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WHAT IS CLAIMED IS: 

1. An isolated nucleic acid molecule comprising a nucleotide sequence selected 
from: 

a) the nucleotide sequence as set forth in SEQ ID NO: 1 ; 

b) the nucleotide sequence as set forth in SEQ ID NO: 5; 

c) the nucleotide sequence as set forth in SEQ ID NO: 7; 

d) the nucleotide sequence as set forth in SEQ ID NO: 11; and 

e) the nucleotide sequence as set forth in SEQ ID NO: 19. 

2. The isolated nucleic acid molecule of claim 1 , comprising a nucleotide sequence 
as set forth in SEQ ID NO: 5. 

3. An isolated nucleic acid molecule comprising at least 300 contiguous bases of a 
nucleotide sequence selected from: 

a) the nucleotide sequence as set forth in SEQ ID NO: 1; 

b) the nucleotide sequence as set forth in SEQ ID NO: 5; 

c) the nucleotide sequence as set forth in SEQ ID NO: 7; 

d) the nucleotide sequence as set forth In SEQ ID NO: 11; and 

e) the nucleotide sequence as set forth in SEQ ID NO: 19. 

4. An isolated nucleic acid molecule comprising a nucleotide sequence encoding 
the amino acid sequence as set forth in SEQ ID NO: 6. 



66 



wo 01/81411 



PCT/USOl/13371 



5. An isolated nucleic acid molecule comprising a nucleotide sequence encoding 
the amino acid sequence as set forth in SEQ ID NO: 2. 

6. An isolated nucleic acid molecule comprising a nucleotide sequence encoding 
the amino acid sequence as set forth in SEQ ID NO: 8. 

7. An isolated nucleic acid molecule comprising a nucleotide sequence encoding 
the amino acid sequence as set forth in SEQ ID NO; 12, 

8. An isolated polypeptide comprising an amino acid sequence selected from: 

a) the amino acid sequence as set forth in SEQ ID NO: 6; 

b) the amino acid sequence as set forth in SEQ ID NO: 2; 

c) the amino acid sequence as set forth in SEQ ID NO: 8; and 

d) the amino acid sequence as set forth in SEQ ID NO: 12. 

9. An isolated polypeptide of claim 8, comprising an amino acid sequence as set 
forth in SEQ ID NO: 6. 

10. An isolated polypeptide comprising an amino acid sequence selected from: 

a) the amino acid sequence as set forth in SEQ ID NO: 6 with at least 
one conservative amino acid substitution; 

b) the amino acid sequence as set forth in SEQ ID NO: 2 with at least 
one conservative amino acid substitution; 

c) the amino acid sequence as set forth in SEQ ID NO: 8 with at least 
one conservative amino acid substitution; and 
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d) the amino acid sequence as set forth in SEQ ID NO: 12 with at 
least one conservative amino acid substitution. 

11. An isolated polypeptide of claim 10, comprising an amino acid sequence as set 
forth in SEQ ID NO: 6 with at least one conservative amino acid substitution. 

12. A recombinant host cell containing an isolated nucleic acid molecule of any of 
claims 1, 2, 3, 4, 5, 6, or 7. 

13. A recombinant host cell of claim 12, wherein the host cell is eukaryotic. 

14. A recombinant host cell of claim 13, wherein the host cell is selected fnDm OHO, 
VERO, BHK, HeLa. COS, MDCK, 293, 3T3. and WI38 cell lines. 
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SEQUENCE LISTING 

<110> Walke, D. Wade 
Wang, Xiaoming 
Scoville, , John 

<120> Novel Human Membrane Proteins and Polynucleotides Encoding the Same 

<130> 07705.0014-00000 

<140> Not yet assigned 
<141> April 27, 2001 

<150> US 60/199,950 
<151> 2000-04-27 



<160> 19 

<170> FastSEQ for Windows Version 4.0 

<210> 1 

<211> 2310 

<212> DNA - 

<213> homo sapiens • 



<400> 1 

atgaacggac 

attcggcgaa 

caacgagttt 

gaagatggag 

agtaggatac 

gagttctgca 

aaaggactga 

tttgccagaa 

ccaaatgcgg 

gaattacaaa 

gtaaaggatg 

gatgccaata 

ttcaacactt 

caactcctag 

gcccttacaa 

gtggtggaac 

ccttcaaatg 

acatttatac 

ttgcttaata 

aagcttttcc 

agcaaaactg 

aagtacaacc 

gcgaaggact 

tgccgctgca 

tatcccaaat 

gctctcacag 

gttttggtca 

gaaaactcca 

aataatgaca 

gccgccttac 

cagctctatt 

atctcattaa 

tattctcaga 

tggctggcaa 

gtacctgtaa 



agttggatct 
ttcctattca 
tcagaggaaa 
atcttataac 
tgaaactgac 
ggaatggtgg 
gatgtacaat 
tcccagtggg 
gcaatccaat 
aagtgacaat 
tcacagcacc 
aattaactgc 
ccagaaatgc 
atgccagtga 
cgcttattga 
ctaacatagc 
ttcgcttctc 
atacaaatgt 
tgacgaaaaa 
aatcaaaaac 
atgaaaatga 
aaaaagaatt 
gggacacata 
accatactac 
cacttgacat 
ttatatttca 
atctgtgcat 
ataagaactt 
tacccaggac 
tgcactattt 
accttctaat 
ttggatgggg 
atggaaataa 
ttccagaacc 
ccattatcct 



aagtgggaag 
taatgaagat 
acttctgagt 
aatttttgat 
attatttgga 
aacctgggaa 
tgctaatttt 
cagatatgga 
ggcagtccgg 
aggaaattgc 
acttaataac 
tgagaacatc 
ttcacctgag 
agatgctttt 
gcaaatggag 
aatacagtca 
tgtgcagaaa 
ggatggcctt 
ttacaccaag 
ttttacagct 
gcaagatcag 
tcaactctat 
tggctgtcaa 
taattttgct 
attatccaac 
gattgtcacc 
atcaatgttg 
gcagacaagt 
agacaccatt 
tctgttagtg 
aaggaccatg 
agtcccagct 
tccacagtgg 
caatggtgtt 
catcagcaat 



ctaatcatca 
attacttatg 
aatgatgaag 
agttctgacc 
aaatctactt 
aatggcagat 
tgtgaaaata 
ccatccttgc 
ttgtgcagtc 
aatgaaaatc 
atttcttctg 
actagtgcta 
gcaaagaaag 
caaagagttg 
acttattcct 
gcaaatttct 
ggagctagca 
aacccagatg 
acatgcggct 
aaatcggatt 
agtgcttctg 
tcctatgcct 
aaagacaagg 
gtattaatga 
gttggatgtg 
aggaaagtca 
attttcaacc 
gatggtgaca 
aacatcccga 
acatttacct 
aagcctcttc 
atagtagtgg 
gaattagact 
ataaaaagtc 
gttgttatgt 



aagctcaact 
atgaattagt 
taacaataaa 
tttcctttgc 
cctcatcaag 
gtatttgtac 
gtacctatat 
aaacatgtgg 
tctctctata 
tggaaaccct 
aagtccagat 
cgcgagtggt 
ttgccatagt 
ctgctactgc 
tgtctttggg 
cttcagaaaa 
gttctctagt 
cacagactga 
ttgtagttta 
ttagtcaaaa 
ttgacatggt 
gtgtctattg 
gcactgatgg 
ctttcaaaaa 
cactgtctgt 
gaaaaacctc 
tcctctttgt 
tcaataatat 
atcccatgtg 
ggaacgcact 
ctcggcattt 
ctataacagt 
accggcaaga 
cgctgttgtg 
ttattacaat 



tggggaggat 
gctaatgatg 
gtataaagat 
aattcagtgc 
cacccctaca 
agaagagtgg 
gggttttact 
caaggatact 
tggagagata 
ggaaaagcag 
tttaacatct 
tggacagata 
aacagtgagt 
taatgatgat 
taatcaatca 
tgcggtgggg 
ttctagttca 
gcttcaggtc 
tcaaaatgac 
aattatctca 
ctttagtcca 
gaatttgtca 
attcctgcgc 
ggattatcaa 
tactggtctg 
agtaacctgg 
gtttggaatt 
tgactttgac 
cactgcgatt 
cagcgctgca 
cattcttttc 
gggagttatt 
gaaaatctgc 
gtcattcatc 
ctcgatcaaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080- 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
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gtgctgtgga agaataacca gaacctgaca agcacaaaaa aagtttcatc catgaagaag 2160 

attgttagca cattatctgt tgcagttgtt tttggaatta cctggattct agcatacctg 2220 

atgctagtta atgatgatag catcaggatc gtcttcagct acatattctg ccttttcaac 2280 

actacacagg tatggtgcgg attagtctga 2310 

<210> 2 
<211> 769 
<212> PRT 

<213> homo sapiens 
<400> 2 

Met Asn Gly Gin Leu Asp Leu Ser Gly Lys Leu lie He Lys Ala Gin 

15 10 15 

Leu Gly Glu Asp He Arg Arg He Pro He His Asn Glu Asp He Thr 

20 25 30 ' 

Tyr Asp Glu Leu Val Leu Met Met Gin Arg Val Phe Arg Gly Lys Leu 

35 40 45 

Leu Ser Asn Asp Glu Val Thr He Lys Tyr Lys Asp Glu Asp Gly Asp 

50 55 60 

Leu He Thr He Phe -Asp Ser Ser Asp Leu Ser Phe Ala He Gin Cys 
65 70 75 80 

Ser Arg He Leu Lys Leu Thr Leu Phe Gly Lys Ser Thr Ser Ser Ser 

85 90 95 

Ser Thr Pro Thr Glu Phe Cys Arg Asn Gly Gly Thr Trp Glu Asn Gly 

100 105 . 110 

Arg Cys He Cys Thr Glu Glu Trp Lys Gly Leu Arg Cys Thr He Ala 

115 120 125 

Asn Phe Cys Glu Asn Ser Thr Tyr Met Gly Phe Thr Phe Ala Arg He 

130 135 140 

Pro Val Gly Arg Tyr Gly Pro Ser Leu Gin Thr Cys Gly Lys Asp Thr 
145 150 155 160 

Pro Asn Ala Gly Asn Pro Met Ala Val Arg Leu Cys Ser Leu Ser Leu 

165 - 170 175 

Tyr Gly Glu He Glu Leu Gin Lys Val Thr He Gly Asn Cys Asn Glu 

180 185 190 

Asn Leu Glu Thr Leu Glu Lys Gin Val Lys Asp Val Thr Ala Pro Leu 

195 200 , 205 

Asn Asn He Ser Ser Glu Val Gin He Leu Thr Ser Asp Ala Asn Lys 

210 215 220 

Leu Thr Ala Glu Asn He Thr Ser Ala Thr Arg Val Val Gly Gin He 
225 230 235 ' 240 

Phe Asn Thr Ser Arg Asn Ala Ser Pro Glu Ala Lys Lys Val Ala He 

245 250 255 

Val Thr Val Ser Gin Leu Leu Asp Ala Ser Glu Asp Ala Phe Gin Arg 

260 265 270 

Val Ala Ala Thr Ala Asn Asp Asp Ala Leu Thr Thr Leu He Glu Gin 

275 280 285 

Met Glu Thr Tyr Ser Leu Ser Leu Gly Asn Gin Ser Val Val Glu Pro 

290 295 300 

Asn He Ala He Gin Ser Ala Asn Phe Ser Ser Glu Asn Ala Val Gly 
305 310 315 320 

Pro Ser Asn Val Arg Phe Ser Val Gin Lys Gly Ala Ser Ser Ser Leu 

325 330 335 

Val Ser Ser Ser Thr Phe He His Thr Asn Val Asp Gly Leu Asn Pro 

340 345 350 

Asp Ala Gin Thr Glu Leu Gin Val Leu Leu Asn Met Thr- Lys Asn Tyr 

355 360 365 

Thr Lys Thr Cys Gly Phe Val Val Tyr Gin Asn Asp Lys Leu Phe Gin 

370 375 380 

Ser Lys Thr Phe Thr Ala Lys Ser Asp Phe Ser Gin Lys He He Ser 
385 390 395 400 

Ser Lys Thr Asp Glu Asn Glu Gin Asp Gin Ser Ala Ser Val Asp Met 
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405 



Val 


Phe 


Ser 


Pro 


Lys 


Tyr Asn Gin 








420 








Ala 


Cys 


Val 


Tyr 


Trp Asn Leu Ser 






435 








440 


Cys 


Gin 


Lys Asp 


Lys 


Gly Thr Asp 




450 










455 


His 


Thr 


Thr 


Asn 


Phe 


Ala 


Val Leu 


465 










470 




Tyr 


Pro 


Lys 


Ser 


Leu Asp 


He Leu 










485 






Val 


Thr 


Gly Leu 


Ala 


Leu 


Thr Val 








500 








Val Arg 


Lys 


Thr 


Ser Val Thr Trp 






515 








520 


Met 


Leu 


He 


Phe 


Asn 


Leu 


Leu Phe 




530 










535 


Lys Asn 


Leu 


Gin 


Thr 


Ser Asp Gly 


545 










550 




Asn 


Asn 


Asp 


He 


Pro Arg Thr Asp 










565 






Cys 


Thr 


Ala 


He 


Ala 


Ala 


Leu Leu 








580 








Thr 


Trp 


Asn 


Ala 


Leu 


Ser 


Ala Ala 






595 








600 


Thr 


Met 


Lys 


Pro 


Leu 


Pro Arg His 




610 










615 


Gly Trp 


Gly Val 


Pro 


Ala 


He Val 


625 










630 




Tyr 


Ser 


Gin 


Asn 


Gly Asn Asn Pro 










645 






Glu 


Lys 


He 


Cys 


Trp 


Leu 


Ala He 








660 








Ser 


Pro 


Leu 


Leu 


Trp 


Ser 


Phe He 






675 








680 


Ser 


Asn 


Val 


Val 


Met 


Phe 


He Thr 




690 










695 


Asn 


Asn 


Gin 


Asn 


Leu 


Thr 


Ser Thr 


705 










710 




He 


Val 


Ser 


Thr 


Leu 


Ser 


Val Ala 










725 






Leu 


Ala 


Tyr 


Leu 


Met 


Leu 


Val Asn 








740 








Ser 


Tyr 


He 


Phe 


Cys 


Leu 


Phe Asn 






755 








760 


Val 

















410 




415 

X iJ 




Lys 


Glu 


Phe Gin Leu Tyr 


Ser 


Tvr 


425 




430 






Ala 


Lys 


AsD Tr"D AsD Thr 


Tvr 


Gly 






445 






Gly Phe 


IjF»n Ar'fT Arcr 


Cys 


Asn 






460 








Thr* 

X nr 


PHp TtV^ Ti\7«i A<?"ri 


lyr 








475 




*a O U 


Ser 


Asn 


V cij_ v^^o xi-L d 








AQCi 




*3 -7 J 




±±e 


CLlB 


Gin Tie* Val Thr 




uy o 












val 


Leu 


V dJ. xaoil XJmi. \^yo 


T1 e> 
XXc 


OCX 












Val 


Pne 


v3j.y xj.e oxu /^sn 


C <-s >- 
OCX 


Asn 












Asp 


ixe 


rioii rioii ±-Lc? rlo^ 




A cr^ 












Thr 


He 


j\STi xxe irro ash 


Pro 


rcieu 




c *i n 

570 




R7 R 




His 


Tyr 


irne iieu iieu vax 


inr 


rne 


585 










Gin 


Leu 


'Pt/t" T't/t* T.aii T.011 

1 y X 1 y X u. Jut? ll 




AX ^ 












Phe 


He 


Lieu rne ixe oer 


Leu 


x±e 












Val 


Ala 


i±& inr va± v3±y 


va± 


xxe 






"5 R 




D4 u 


Gin 


Trp 


O Til T ^1 ^ TV 

oxu jjsu risp J-yr 


Arg 


oxn 




650 




DOO 




Pro 


Glu 


CJLKJ rion oXy Va.X 


T1 o 

xxe 


Lys 


665 




D / U 






Val 


Pro 


Val THv Tlfii Tlo 

vax iiir JLxe xxe 


Leu 


T 1 o 

xxe 






DO o 






He 


Ser 


xxe jjys V a.x jjeu 


Trp 


Lys 






/ uu 






Lys 


Lys 


Val Ser Ser Met 


Lys 


Lys 






715 




720 


Val 


Val 


Phe Gly He Thr 


Trp 


He 




730 




735 




Asp Asp 


Ser He Arg He 


Val 


Phe 


745 




750 






Thr 


Thr' 


Gin Val Trp Cys 


Gly 


Leu 



765 



<210> 3 

<211> 891 

<212> DNA 

<213> homo sapiens 

<400> 3 

atgaacggac agttggatct aagtgggaag ctaatcatca aagctcaact tggggaggat 60 

attcggcgaa ttcctattca taatgaagat attacttatg atgaattagt gctaatgatg 120 

caacgagttt tcagaggaaa acttctgagt aatgatgaag taacaataaa gtataaagat 180 

gaagatggag atcttataac aatttttgat agttctgacc tttcctttgc aattcagtgc 240 

agtaggatac tgaaactgac attatttgga aaatctactt cctcatcaag cacccctaca 300 

gagttctgca ggaatggtgg aacctgggaa aatggcagat gtatttgtac agaagagtgg 360 

aaaggactga gatgtacaat tgctaatttt tgtgaaaata gtacctatat gggttttact 420 

tttgccagaa tcccagtggg cagatatgga ccatccttgc aaacatgtgg caaggatact 480 

ccaaatgcgg gcaatccaat ggcagtccgg ttgtgcagtc tctctctata tggagagata 540 
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gaattacaaa aagtgacaat aggaaattgc aatgaaaatc tggaaaccct ggaaaagcag 600 

gtaaaggatg tcacagcacc acttaataac atttcttctg aagtccagat tttaacatct 660 

gatgccaata aattaactgc tgagaacatc actagtgcta cgcgagtggt tggacagata 720 

ttcaacactt ccagaaatgc ttcacctgag gcaaagaaag ttgccatagt aacagtgagt 780 

caactcctag atgccagtga agatgctttt caaagagttg ctgctactgc taatgatgat . 840 

gcccttacaa cattacacca agacatgcgg ctttgtagtt tatcaaaatg a 891 

<210> 4 
<211> 296 
<212> PRT 

<213> homo sapiens 



<400> 4 






























Met 


Asn 


Gly Gin 


Leu 


Asp 


Leu 


Ser 


Gly 


Lys 


Leu 


He 


He 


Lys 


Ala 


Gin 


1 








5 










10 










15 




Leu Gly 


Glu Asp 


He 


Arg Arg 


He 


Pro 


He 


His 


Asn 


Glu 


Asp 


He 


Thr 








20 










25 










30 






Tyr Asp 


Glu 


Leu 


Val 


Leu 


Met 


Met 


Gin 


Arg 


Val 


Phe 


Arg 


Gly 


Lys 


Leu 






35 










40 










45 








Leu 


Ser 


Asn Asp 


Glu 


Val 


Thr 


He 


Lys 


Tyr 


Lys 


Asp 


Glu 


Asp 


Gly Asp 




50 










55 










60 










Leu 


He 


Thr 


He 


Phe 


Asp 


Ser 


Ser 


Asp 


Leu 


Ser 


Phe 


Ala 


He 


Gin 


Cys 


65 










70 










75 










80 


Ser 


Arg 


He 


Leu 


Lys 
85 


Leu 


Thr 


Leu 


Phe 


Gly 
90 


Lys 


Ser 


Thr 


Ser 


Ser 
95 


Ser 


Ser 


Thr 


Pro 


Thr 


Glu 


Phe 


Cys 


Arg 


Asn 


Gly 


Gly Thr 


Trp 


Glu 


Asn 


Gly 








100 










105 










110 






Arg Cys 


He 


Cys 


Thr 


Glu 


Glu 


Trp 


Lys 


Gly 


Leu Arg 


Cys 


Thr 


He 


Ala 






115 










120 










125 








Asn 


Phe 
130 


Cys 


Glu 


Asn 


Ser 


Thr 
135 


Tyr 


Met 


Gly 


Phe 


Thr 
140 


Phe 


Ala 


Arg 


He 


Pro 


Val 


Gly Arg Tyr 


Gly 


Pro 


Ser 


Leu 


Gin 


Thr 


Cys 


Gly 


Lys 


Asp 


Thr 


145 










150 










155 










160 


Pro 


Asn 


Ala Gly Asn 


Pro 


Met 


Ala 


Val 


Arg 


Leu 


Cys 


Ser 


Leu 


Ser 


Leu 










165 










170 








175 




Tyr 


Gly 


Glu 


He 
180 


Glu 


Leu 


Gin 


Lys 


Val 
185 


Thr 


He 


Gly 


Asn 


Cys 
190 


Asn 


Glu 


Asn 


Leu 


Glu Thr Leu 


Glu 


Lys 


Gin 


Val 


Lys 


Asp 


Val 


Thr 


Ala 


Pro 


Leu 






195 










200 






205 








Asn 


Asn 


He 


Ser 


Ser 


Glu 


Val 


Gin 


He 


Leu 


Thr 


Ser 


Asp 


Ala 


Asn 


Lys 




210 










215 










220 






Leu 


.Thr 


Ala 


Glu 


Asn 


He 


Thr 


Ser 


Ala 


Thr 


Arg 


Val 


Val 


Gly 


Gin 


lie 


225 










230 










235 








240 


Phe 


Asn 


Thr 


Ser 


Arg 


Asn 


Ala 


Ser 


Pro 


Glu 


Ala 


Lys 


Lys 


Val 


Ala 


He 










245 










250 






255 




Val 


Thr 


Val 


Ser 


Gin 


Leu 


Leu 


Asp 


Ala 


Ser 


Glu 


Asp 


Ala 


Phe 


Gin Arg 








260 










265 










270 






Val 


Ala 


Ala 
275 


Thr 


Ala 


Asn 


Asp 


Asp 
280 


Ala 


Leu 


Thr 


Thr 


Leu 
285 


His 


Gin 


Asp 


Met 


Arg 
290 


Leu 


Cys 


Ser 


Leu 


Ser 
295 


Lys 



















<210> 5 
<211> 2547 
<212> DNA 

<213> homo sapiens ' 
<400> 5 

atgaacggac agttggatct aagtgggaag ctaatcatca aagctcaact tggggaggat 60 
attcggcgaa ttcctattca taatgaagat attacttatg atgaattagt gctaatgatg 120 
caacgagttt tcagaggaaa acttctgagt aatgatgaag taacaataaa gtataaagat 180 
gaagatggag atcttataac aatttttgat agttctgacc tttcctttgc aattcagtgc 240 
agtaggatac tgaaactgac attatttgga aaatctactt cctcatcaag cacccctaca 300 
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gagttctgca ggaatggtgg aacctgggaa aatggcagat gtatttgtac agaagagtgg 360 

aaaggactga gatgtacaat tgctaatttt tgtgaaaata gtacctatat gggttttact 420 

tttgccagaa tcccagtggg cagatatgga ccatccttgc aaacatgtgg caaggatact 480 

ccaaatgcgg gcaatccaat ggcagtccgg ttgtgcagtc tctctctata tggagagata 540 

gaattacaaa aagtgacaat aggaaattgc aatgaaaatc tggaaaccct ggaaaagcag 600 

gtaaaggatg tcacagcacc acttaataac atttcttctg aagtccagat tttaacatct 660 

gatgccaata aattaactgc tgagaacatc actagtgcta cgcgagtggt tggacagata 720 

ttcaacactt ccagaaatgc ttcacctgag gcaaagaaag ttgccatagt aacagtgagt 780 

caactcctag atgccagtga agatgctttt caaagagttg ctgctactgc taatgatgat 840 

gcccttacaa cgcttattga gcaaatggag acttattcct tgtctttggg taatcaatca 900 

gtggtggaac ctaacatagc aatacagtca gcaaatttct cttcagaaaa tgcggtgggg 960 

ccttcaaatg ttcgcttctc tgtgcagaaa ggagctagca gttctctagt ttctagttca 1020 

acatttatac atacaaatgt ggatggcctt aacccagatg cacagactga gcttcaggtc 1080 

ttgcttaata tgacgaaaaa ttacaccaag acatgcggct ttgtagttta tcaaaatgac 1140 

aagcttttcc aatcaaaaac ttttacagct aaatcggatt ttagtcaaaa aattatctca 1200 

agcaaaactg atgaaaatga gcaagatcag agtgcttctg ttgacatggt ctttagtcca 1260 

aagtacaacc aaaaagaatt tcaactctat tcctatgcct gtgtctattg gaatttgtca 1320 

gcgaaggact gggacacata tggctgtcaa aaagacaagg gcactgatgg attcctgcgc 1380 

tgccgctgca accatactac taattttgct gtattaatga ctttcaaaaa ggattatcaa 1440 

tatcccaaat cacttgacat attatccaac gttggatgtg cactgtctgt tactggtctg 1500 

gctctcacag ttatatttca gattgtcacc aggaaagtca gaaaaacctc agtaacctgg 1560 

gttttggtca atctgtgcat atcaatgttg attttcaacc tcctctttgt gtttggaatt 1620 

gaaaactcca ataagaactt gcagacaagt gatggtgaca tcaataatat tgactttgac 1680 

aataatgaca tacccaggac agacaccatt aacatcccga atcccatgtg cactgcgatt 174 0 

gccgccttac tgcactattt tctgttagtg acatttacct ggaacgcact cagcgctgca 1800 

cagctctatt accttctaat aaggaccatg aagcctcttc ctcggcattt cattctttt.c 1860 

atctcattaa ttggatgggg agtcccagct atagtagtgg ctataacagt gggagttatt 1920 

tattctcaga atggaaataa tccacagtgg gaattagact accggcaaga gaaaatctgc 1980 

tggctggcaa ttccagaacc caatggtgtt ataaaaagtc cgctgttgtg gtcattcatc 2040 

gtacctgtaa ccattatcct catcagcaat gttgttatgt ttattacaat ctcgatcaaa 2100 

gtgctgtgga agaataacca gaacctgaca agcacaaaaa aagtttcatc catgaagaag 2160 

attgttagca cattatctgt tgcagttgtt tttggaatta cctggattct agcatacctg 2220 

atgctagtta atgatgatag catcaggatc gtcttcagct acatattctg ccttttcaac 2280 

actacacagg gattgcaaat ttttatcctg tacactgtta gaacaaaagt cttccagagt 2340 

gaagcttcca aagtgttgat gttgctatcg tctattggga gaaggaagtc attgccttca 2400 

gtgacgcggc cgaggctgcg tgtaaagatg tataatttcc tcaggtcatt gccaacctta 24 60 

catgaacgct ttaggctact ggaaacctct ccgagtactg aggaaatcac actctctgaa 2520 

agtgacaatg caaaggaaag catctag 2547 

<210> 6 
<211> 848 
<212> PRT 

<213> homo sapiens 
<400> 6 

Met Asn Gly Gin Leu Asp Leu Ser Gly Lys Leu He He Lys Ala Gin 

15 10 15 

Leu Gly Glu Asp He Arg Arg He Pro He His Asn Glu Asp He Thr 

20 25 30 

Tyr Asp Glu Leu Val Leu Met Met Gin Arg Val Phe Arg Gly Lys Leu 

35 40 .45 

Leu Ser Asn Asp Glu Val Thr He Lys Tyr Lys Asp Glu Asp Gly Asp 

50 55 60 

Leu He Thr He Phe Asp Ser Ser Asp Leu Ser Phe Ala He Gin Cys 
65 70 75 , 80 

Ser Arg He Leu Lys Leu Thr Leu Phe Gly Lys Ser Thr Ser Ser Ser 

85 90 95 

Ser Thr Pro Thr Glu Phe Cys Arg Asn Gly Gly Thr Trp Glu Asn Gly 

100 105 110 

Arg Cys He Cys Thr Glu Glu Trp Lys Gly Leu Arg Cys Thr He Ala 

115 120 125 

Asn Phe Cys Glu Asn Ser Thr Tyr Met Gly Phe Thr Phe Ala Arg He 



5 



wo 01/81411 



PCTAJSOl/13371 



Pro 


Val 




Arcf 


145 








Pro 


Asn 


ri-La 


vjxy 


Tyr Gly 




XX6 








1 fin 

J. o u 


Asn 


Leu 


Glu 


Thr 






195 




Asn 


Asn 




OCX 










Leu 


Tnr 


AX a 


(crXU 


225 








Phe 


Asn 


j.nr 


Ser 


Val 


Thr 


vax 


Ser 








du 


Val 


Ala 


Ala 


Thr 






275 




Met 


Glu 


inr 


Tyr 




290 




Asn 


He 


Ala 


lie 


305 








Pro 


Ser 


Asn 


vai 


Val 


Ser 


Ser 


Ser 








340 


Asp Ala 


Gin 


Thr 






355 




•Thr Lys 


Thr 


Cys 




370 






Ser Lys 


inr 


Jrne 


385 








Ser Lys 


inr 


Asp 


Val 


Phe 


Ser 


Pro 








420 


Ala 


Cys 


Val 


Tyr 






435 




cys 


Gin 


Lys 


Asp 




450 






His 


Thr 


inr 


Asn 


465 








Tyr 


Pro 


Lys 


Ser 


Val 


Thr 


bly 


Leu 








c n n 


Val 


Arg 


Lys 


Thr 






515 




Met 


Leu 


ixe 


Jrne 




530 






Lys 


Asn 


Leu 


bin 


545 








Asn 


Asn 


Asp 


lie 


Cys 


Thr 


nXo. 


TT 

j.xe 








t;Q n 
oo u 


Thf Trp 


Asn 


Ala 






595 




Thr 


Met 


Lys 


Pro 




610 






Gly Trp 


Gly 


Val 


625 








Tyr 


Ser 


Gin 


Asn 



lyr 




Prri 


r 

wJ C JL 




X •J 






Asn 


iri. u 




AT 

a. 


X v;>J 








uXU 


J-lC u. 


r;i n 

OXli 




Leu 


Glu 


Lys 


Gin 








200 


Ser 


Glu 


Val 


Gin 






215 




7\ c? n 




Thr 


So r 
c X 










Arg 


Asn 


an a 


oex 


z ^ o 








vjin 


Leu 


Leu 




riXo. 


Asn 




•**aP 










Ser 


Leu 


Ser 


Leu 






295 




bin 


Ser 


Axa 


Asn 




JX u 






Arg 


PVl£i 

irne 


Q ci -v- 

o ex 


vax 


•3 O C 








inr 


fne 


xxe 


JUS 


blU 


Leu 


bxn 


vax 








ODU 


Gly 


Phe 


Val 


Val 






375 




inr 


Axa 


Lys 


Ser 










oXU 


Asn 


oX U 


OXII 


40o 








Lys 


Tyr 


Asn 


bxn 


Trp 


Asn 


Leu 


C -K* 

oer 










Lys 


Gly 


Thr 


Asp 






455 




Jrne 


AX a 


vax 


Leu 




4 / u 






Leu 


Asp 


xxe 


J-ieu 










AXa 


Leu 


Th-r 
X IIX 


Va 1 
V cLX 


C c> Y> 
OCX 


V a.x 


Thr 












Asn 


Leu 


Leu 


Phe 






535 




Trir 


OC7X 




oxy 




OO U 






iri. 








O DO 








AX a 


2X1 a 


Leu 


Leu 


Leu 


Ser 


Ala 


Ala 








600 


Leu 


Pro 


Arg 


His 






615 




Pro 


Ala 


He 


Val 




630 






Gly 


Asn 


Asn 


Pro 



Leu 


Gin 


Thr 


Cys 






155 




Val 


Arg 


Leu 


Cys 




170 






Val 


Thr 


He 


Glv 


185 








Val 


Lys 


Asp 


Val 


lie 


Leu 


Thr 


Ser 








220 


Ala 


Thr 


Arg 


Val 






235 




Pro 


Glu 


Ala 






9sn 






Ala 


OCX 


^3X11 












Al a 


Leu 


Thr 


Thr 


Vjxy 


A cm 




OCX 








o u u 


Phe 


Ser 


Ser 


Glu 






315 








f^l \T 


Al p 




JOU 






i nr 


2X c n 


\7"a 1 
V dX 












Leu 


Leu 






Tyr 


ril n 

oxn 


ZX cn 


2X er^ 








O 0 u 


Asp 


Phe 


Ser 


Gin 






395 




2X C7~i 




OCX. 


Al p 
rax d 




*l xu 






Lys 


d 11 

uXU. 


PHo 
irne 


uxn 


/ 9 c; 








7X1 a 

AX a 


Lys 


Asp 


irp 


bxy 


Jrne 


Leu 


Arg 








ft DU 


Met 


Thr 


Phe 


Lys 






475 




Ser 


Asn 


\7"a 1 
V dX 


fil w 
oxy 




A on 






Tl *=> 
XXc 


Phfi 
jtriic 




Tip 
J. xc 










vax 


Leu 


Va 1 
V ax 


ZX G n 


Val 

V d-L 


Phe 


Gly 


He 








540 


Asp 


He 


Asn 


Asn 






555' 




Thr 


Tip 

J- -L C 




Tl p 
J. xc 




•J f \j 






• riXo 


iyr 


Php 


iJC u. 










Gin 


Leu 


Tyr 


Tyr 


Phe 


He 


Leu 


Phe 








620 


Val 


Ala 


He 


Thr 






635 




Gin 


Trp 


Glu 


Leu 



Glv 

vjx_y 


Lys 


Asp 


Thr 








160 


Ser 


Leu 


Ser 


Leu 






175 




Asn 


Cys 


Asn 


Glu 




190 






Thr 


Ala 


Pro 


Leu 


205 








Asp 


Ala 


Asn 


Lvs 


Val 


vj-L jr 


Gin 


He 








240 


Lys 


Val 


Ala 


He 




255 




Al a 


Php 




Arrr 
nx y 




970 






Leu 


Tip 

X Xc 


ox u. 


f;i n 


9fl R 








V ax 


Val 

V dX 


Glu 


Pro 


Asn 


Ala 


Val 


Gly 








320 

O^ 


Ser 


Ser 


Ser 


Leu 






335 




nl w 
(jxy 


Leu 


A Qn 


Pm 
r^x 










1 nx 


Lys 




iyr 


o OO 








j-iy ^ 


xicu 


Php 


Gin 


J is 


Tl p 
xxe 


He 


Ser 








4 00 

*i U Vy/ 


Ser 


Val 


Asp 


Met 






415 




Leu 


lyr 


^pr 

OCX 


Tvr 










ZX ar^ 


JL XIX 


iyx 


vjxy 












ZX "rrr 
x-iX ^ 






Lys 


ZX QT^ 


iyx 


f;i n 








O \J 


Cys 


Ala 


Leu 


Ser 






495 




Val 
V dx 


Thr 

X IXX 


Arg 






•^1 n 

*J X 






Leu 




Tl P 
xxe 


^pr 
o cx 


R9 S 








CI n 

OX Ll 


A QTl 


^pr 

OCX 




He 


Asp 


Phe 


Asp 








560 


Pro 


Asn 


Pro 


Met 






575 




Leu 


Val 


Thr 


Phe 




'590 






Leu 


Leu 


He 


Arg 


605 








He 


Ser 


Leu 


He 


Val 


Gly 


Val 


He 






640 


Asp 


Tyr 


Arg 


Gin 
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645 650 655 



Glu 


Lys 


He 


Cys 
660 




Leu 


Ala 


He 


Pro 
665 


Glu 


Pro 


Asn 


Gly 


Val 
670 


He 


Lys 


Ssx" 


Pro 


Leu 


Leu 




Ser 


Phe 


He 


Val 


Pro 


Val 


Thr 


He 


He 


• Leu 


He 






675 








680 










685 










Asn 
690 


Val 


Val 


Met 


Phe 


He 
695 


Thr 


He 


Ser 


He 


Lys 
700 


Val 


Leu 


Tro 


Lys 


Asn 


Asn 


Gin 


Asn 


Leu 


Thr 


Ser 


Thr 


Lvs 

u 


Lys 


Val 


Ser 


Ser 


Met 


Lys 


Lys 


705 










710 










715 










720 


Tip 

J- -L C; 


Val 


Ser 


Thr 


Leu 
725 


Ser 


Val 


Ala 


Val 


Val 
730 


Phe 


Gly 


He 


Thr 


735 


He 


J_lC u 


Ala 


lyi: 


Leu 
740 




Leu 


Val 


Asn 


Asp 
745 


Asp 


Ser 


He 


Arg 


He 
750 


Val 


Phe 


Q o r" 


T\/T 

j.yr 








T.011 

XJtJ Li 


Phe 


Asn 


± ilX 


Thr* 

X liX 


Gin Gly 


XIC ul 


m n 


He 


Phe 






755 










760 










765 










770 


xyr 


Thr 


V ciX 


r^X ^ 


Thr 
775 


Lys 


V dx 


IT lie? 


Gin 


Ser 
780 


ox LI 


Ala 


OCX 


XJj^ o 


Val 


Leu 


Met 


Leu 


Leu 


Ser 


Ser 


He 


Gly 


Arg 


Arg Lys 


Ser 


Leu 


Pro 


Ser 


785 










790 










795 










800 


Val 


Thr 


Arg 


Pro 


Arg 


Leu 


Arg Val 


Lys 


Met 


Tyr Asn 


Phe 


Leu 


Arg 


Ser 










805 










810 










815 




Leu 


Pro 


Thr 


Leu 
820 


His 


Glu 


Arg 


Phe 


Arg 
825 


Leu 


Leu 


Glu 


Thr 


Ser 
830 


Pro 


Ser 


Thr 


Glu 


Glu 


He 


Thr 


Leu 


Ser 


Glu 


Ser 


Asp 


Asn 


Ala 


Lys 


Glu 


Ser 


He 



835 840 845 

<210> 7 

<211> 1446 

<212> DNA 

<213> homo sapiens 



<400> 7 

atggagactt attccttgtc tttgggtaat caatcagtgg tggaacctaa catagcaata 60 

cagtcagcaa atttctcttc agaaaatgcg gtggggcctt caaatgttcg cttctctgtg 120 

cagaaaggag ctagcagttc tctagtttct agttcaacat ttatacatac aaatgtggat 180 

ggccttaacc cagatgcaca gactgagctt caggtcttgc ttaatatgac gaaaaattac 240 

accaagacat gcggctttgt agtttatcaa aatgacaagc ttttccaatc aaaaactttt 300 

acagctaaat cggattttag tcaaaaaatt atctcaagca aaactgatga aaatgagcaa 360 

gatcagagtg cttctgttga catggtcttt agtccaaagt acaaccaaaa agaatttcaa 420 

ctctattcct atgcctgtgt ctattggaat ttgtcagcga aggactggga cacatatggc 480 

tgtcaaaaag acaagggcac tgatggattc ctgcgctgcc gctgcaacca tactactaat 540 

tttgctgtat taatgacttt caaaaaggat tatcaatatc ccaaatcact tgacatatta 600 

tccaacgttg gatgtgcact gtctgttact ggtctggctc tcacagttat atttcagatt 660 

gtcaccagga aagtcagaaa aacctcagta acctgggttt tggtcaatct gtgcatatca 720 

atgttgattt tcaacctcct ctttgtgttt ggaattgaaa actccaataa gaacttgcag 780 

acaagtgatg gtgacatcaa taatattgac tttgacaata atgacatacc caggacagac 840 

accattaaca tcccgaatcc catgtgcact gcgattgccg ccttactgca ctattttctg 900 

ttagtgacat ttacctggaa cgcactcagc gctgcacagc tctattacct tctaataagg 960 

accatgaagc ctcttcctcg gcatttcatt cttttcatct cattaattgg atggggagtc 1020 

ccagctatag tagtggctat aacagtggga gttatttatt ctcagaatgg aaataatcca 1080 

cagtgggaat tagactaccg gcaagagaaa atctgctggc tggcaattcc agaacccaat 1140 

ggtgttataa aaagtccgct gttgtggtca ttcatcgtac ctgtaaccat tatcctcatc 1200 

agcaatgttg ttatgtttat tacaatctcg atcaaagtgd tgtggaagaa taaccagaac 1260 

ctgacaagca caaaaaaagt ttcatccatg aagaagattg ttagcacatt atctgttgca 1320 

gttgtttttg gaattacctg gattctagca tacctgatgc tagttaatga tgatagcatc 1380 

aggatcgtct tcagctacat attctgcctt ttcaacacta* cacaggtatg gtgcggatta 14 40 

gtctga 144 6 



<210> 8 
<211> 481 
<212> PRT 

<213> homo sapiens 
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<400> 8 





O J- LL 


Thr Tyr 


Ser 
5 


Leu 


Ser 


J-J c u. 




1 

Asn 


He 


Ala 


He 


Gin 


Ser 


Ala 


Asn 


Phe 








20 










25 


Pro 


Ser 


Asn Val Arg Phe 


Ser 


Val 


Gin 






35 










40 




Val 


Ser 


Ser 


Ser 


Thr 


Phe 


He 


His 


Thr 




50 










55 






Asp Ala 


Gin 


Thr 


Glu 


Leu 




V CLX 




65 










70 








Thr 


Lys 


Thr 


Cys 


Gly 


Phe 


Val 

V CL-1. 


V d J_ 


lyr 










85 










Ser 


Lys 


Thr 


Phe 


Thr 


Ala 


j-ty o 


Qor" 










100 










105 


Ser Lys 


Thr Asp 


Glu 


Asn 




fz^ n 


Asp 






115 










1 90 




Val 


Phe 


Ser 


Pro 


Lys 


Tyr 




(Z^ n 

oxn 


Lys 




130 










1 "^R 






Ala 


Cys 


Val 


Tyr 


Trp Asn 


Leu 


Ser 


rxXa 


145 










150 








Cys 


Gin 


Lys 


Asp 


Lys 


Gly 


i nr 


Asp 


oxy 










165 










His 


Thr 


Thr 


Asn 


Phe 


Ala 


vajL 


Leu 


Met 








180 










loo 


Tyr 


Pro 


Lys 


Ser 


Leu 


Asp 


j.±e 


Leu 


Ser 






195 










9 nn 
z uu 




Val 


Thr 


Gly Leu Ala 


Leu 


Tnr 


vaJL 


T 1 ^ 

-Lie 




210 










Z X o 






Val 


Arg 


Lys 


Thr 


Ser 


Val 


inr 


Trp 


vax 


225 










230 








Met 


Leu 


He 


Phe 


Asn 


Leu 


Leu 


rne 


vax 










245 










Lys 


Asn 


Leu 


Gin 


Thr 


Ser 


Asp 


b±y 


Asp 








260 










O C Q 


Asn 


Asn 


Asp 


He 


Pro Arg 


Tnr 


Asp 


Thr 






275 










9Q n 




Cys 


X nr 


Ala 


He 


Ala 


Ala 


Leu 


Leu 


nis 




290 










O Q 






Thr 


Trp 


Asn 


Ala 


Leu 


Ser 


Axa 


Ax a 


Ljxn 


305 










310 








Thr 


Met 


Lys 


Pro 


Leu 


Pro 


Arg 


JtllS 


irne 










325 










Gly 


Trp 


Gly Val 


Pro 


Ala 


xxe 


vax 


vax 








340 










JhD 


Tyr 


Ser 


Gin 


Asn 


Gly Asn 


Asn 


rrro 








355 














Glu 


Lys 


He 


Cys 


Trp 


Leu 




XXc 


JtrX 




370 










•^7 R 






Ser 


Pro 


Leu 


Leu 


Trp 


Ser 




Tip 

JL X t: 


V Cl X 


385 








390 








Ser 


Asn 


Val 


Val 


Met 


Phe 


X J. c> 




Tl P 

X X 










405 










Asn 


Asn 


Gin 


Asn 


Leu 


Thr 


Ser 


Thr 


Lys 








420 










425 


lie 


Val 


Ser 


Thr 


Leu 


Ser 


Val 


Ala 


Val 






435 










440 




Leu 


Ala 


Tyr 


Leu 


Met 


Leu 


Val 


Asn 


Asp 




450 










455 






Ser 


Tyr 


He 


Phe 


Cys 


Leu 


Phe 


Asn 


Thr 



465 470 
Val 
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Asn 


Gin 


Ser 


Val 


Val 


Glu 


Pro 


10 










15 




Ser 


Ser 


Glu 


Asn 


Ala 


Val 


Gly 










30 






L\79 


■ Gly 


Ala 


Ser 


Ser 


Ser 


Leu 








45 








Asn 


Val 


Asp 


Gly 


Leu 


Asn 


Pro 






60 










TiPTl 


Asn 


Met 


Thr 


Lys 


Asn 


Tvr 
•L yx 




75 










80 










JJ c u. 


Php 

r lie 


m n 

uXli 


90 










95 






Q o V 
OcX 


ai n 


Xiy o 


Tl P 

X XC; 


Tl P 
X xc 


OCX 










110 






oXil 


OCX 


Ala 

AXo 


OCX 


V ctx 




L 








125 








OXU 


Php 
rne 


oxn 


Leu 


lyr 


oex 


Tt7 V 

lyr 






X ^ Lf 










Jjys 


Asp 


irp 


Asp 


1 iiX 


iyr 


oxy 




1 RR 










1 fin 

X D U 


irne 


Leu 


7\ -V- ^ 

fix y 


Pt/c 

v^y£> 


7\ y~ /~T 

rlX y 


uys 


A c i~i 


1 7 n 
X / u 










1 7 R 
X / D 




Jinr 


rne 


Lys 


Lys 


Asp 


Tyr 


P 1 rt 

oxn 










1 QO 

x;7U 






Asn 


vax 


LrXy 


Cys 


Ala 

AX a 


Leu 


Ser 
















rne 


uxn 


xxe 


vax 


1 nr 


Arg 


Lys 






9 9 A 
ZZU 










Leu 


vax 


Asn 


Leu 


L^ys 


xxe 


oer 




O "5 R 












rne 


uxy 


xxe 


Pill 

oXU 


Asn 


Ser 


Asn 


9 Rn 














T 1 A 

xxe 


Asn 


Asn 


xxe 


Asp 


rne 


Asp 










z / u 






xxe 


Asn 


T 1 Q 

xxe 


Pro 


Asn 


Pro 


Me L 








O Q ^ 








Tyr 


rne 


Leu 


Leu 


vax 


1 nr 


Dho 

rne 














Leu 


Tyr 


Tyr 


Leu 


Leu 


T 1 o 

xxe 


Arg 
















xxe 


Leu 


rfie 


T 1 o 

xxe 


oer 


Leu 


Tip 

xxe 












•5 "3 c; 




r\xa 


T 1 o 

xxe 


1 nr 


vax 


PI 
oxy 


V ax 


T 1 0 

xxe 










J J y 






irp 


OX LI 


jjeu 


H C3r^ 
riop 


iyr 


rixy 


ni n 








365 








oX U 


Pm 
r X u 




oxy 


Va 1 
V dX 


Tip 

xxe 


xiy 0 






380 










r X c 


V ctx 


Thr- 

J. iXX 


Tip 
xxe 


Tl P 
X X c; 


T.pn 


T 1 P 
xxe 




395 










400 


kJOX 


Tie 

xxe 




V dx 


T.pn 

XJC LL 


T 

irp 


T.\7Q 

xiy 0 


410 










415 






Val 




C Q y 

oex 




XI u 


xiy 0 










430 






Val 


Phe 


Gly 


He 


Thr 


Trp 


He 








445 








Asp 


Ser 


He 


Arg 


He 


Val 


Phe 






460 










Thr 


Gin 


Val 


Trp 


Cys 


Gly 


Leu 




475 










480 
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<210> 9 
<211> 1272 
<212> DNA 

<213> homo sapiens 
<400> 9 

atgccagtga agatgctttt caaagagttg ctgctactgc taatgatgat gcccttacaa 60 

cattacacca agacatgcgg ctttgtagtt tatcaaaatg acaagctttt ccaatcaaaa 120 

acttttacag ctaaatcgga ttttagtcaa' aaaattatct caagcaaaac tgatgaaaat 180 

gagcaagatc agagtgcttc tgttgacatg gtctttagtc caaagtacaa ccaaaaagaa 240 

tttcaactct attcctatgc ctgtgtctat tggaatttgt cagcgaagga ctgggacaca * 300 

tatggctgtc aaaaagacaa gggcactgat ggattcctgc gctgccgctg caaccatact 360 

actaattttg ctgtattaat gactttcaaa aaggattatc aatatcccaa atcacttgac 420 

atattatcca acgttggatg tgcactgtct gttactggtc tggctctcac agttatattt 480 

cagattgtca ccaggaaagt cagaaaaacc tcagtaacct gggttttggt caatctgtgc 540 

atatcaatgt tgattttcaa cctcctcttt gtgtttggaa ttgaaaactc caataagaac 600 

ttgcagacaa gtgatggtga catcaataat attgactttg acaataatga catacccagg 660 

acagacacca ttaacatccc gaatcccatg tgcactgcga ttgccgcctt actgcactat 720 

tttctgttag tgacatttac ctggaacgca ctcagcgctg cacagctcta ttaccttcta 780 

ataaggacca tgaagcctct tcctcggcat ttcattcttt tcatctcatt aattggatgg 840 

ggagtcccag ctatagtagt ggctataaca gtgggagtta tttattctca gaatggaaat 900 

aatccacagt gggaattaga ctaccggcaa gagaaaatct gctggctggc aattccagaa 960 

cccaatggtg ttataaaaag tccgctgttg tggtcattca tcgtacctgt aaccattatc 1020 

ctcatcagca atgttgttat gtttattaca atctcgatca aagtgctgtg gaagaataac 1080 

cagaacctga caagcacaaa aaaagtttca tccatgaaga agattgttag cacattatct 1140 

gttgcagttg tttttggaat tacctggatt ctagcatacc tgatgctagt taatgatgat 1200 

agcatcagga tcgtcttcag ctacatattc tgccttttca acactacaca ggtatggtgc 1260 

ggattagtct ga 1272 

<210> 10 
<211> 423 
<212> PRT 

<213> homo sapiens 



<400> 10 



Met Pro Val 


Lys 


Met 


Leu 


Phe 


Lys 


Glu 


Leu 


Leu 


Leu 


Leu 


Leu Met 


Met 


1 




5 










10 








15 




Met Pro Leu 


Gin 


His 


Tyr 


Thr 


Lys 


Thr 


Cys 


Gly 


Phe 


Val 


Val Tyr 


Gin 




20 










25 










30 




Asn Asp Lys 


Leu 


Phe 


Gin 


Ser 


Lys 


Thr 


Phe 


Thr 


Ala 


Lys 


Ser Asp 


Phe 


35 










40 










45 






Ser Gin Lys 


He 


He 


Ser 


Ser 


Lys 


Thr 


Asp 


Glu 


Asn 


Glu 


Gin Asp 


Gin 


50 








55 










60 








Ser Ala Ser 


Val 


Asp 


Met 


Val 


Phe 


Ser 


Pro 


Lys 


Tyr 


Asn 


Gin Lys 


Glu 


65 






70 










75 








80 


Phe Gin Leu 


Tyr 


Ser 


Tyr 


Ala 


Cys 


Val 


Tyr 


Trp 


Asn 


Leu 


Ser Ala 


Lys 






85 










90 








95 




Asp Trp Asp 


Thr 


Tyr 


Gly 


Cys 


Gin 


Lys 


Asp 


Lys 


Gly 


Thr 


Asp Gly 


Phe 




100 










105 










110 




Leu Arg Cys 


Arg 


Cys 


Asn 


His 


Thr 


Thr 


Asn 


Phe 


Ala 


Val 


Leu Met 


Thr 


115 










120 










125 






Phe Lys Lys 


Asp 


Tyr 


Gin 


Tyr 


Pro 


Lys 


Ser 


Leu Asp 


He 


Leu Ser 


Asn 


130 








135 










140 








Val Gly Cys Ala 


Leu 


Ser 


Val 


Thr 


Gly 


Leu 


Ala 


Leu 


Thr 


Val He 


Phe 


145 






150 










155 








160 


Gin He Val 


Thr 


Arg 


Lys 


Val 


Arg 


Lys 


Thr 


Ser 


Val 


Thr 


Trp Val 


Leu 






165 










170 








175 




Val Asn Leu 


Cys 


He 


Ser 


Met 


Leu 


He 


Phe 


Asn 


Leu 


Leu 


Phe Val 


Phe 




180 










185 










190 




Gly He Glu 


Asn 


Ser 


Asn 


Lys 


Asn 


Leu 


Gin 


Thr 


Ser 


Asp 


Gly Asp 


He 


195 










200 










205 






Asn Asn He 


Asp 


Phe 


Asp 


Asn 


Asn 


Asp 


He 


Pro 


Arg 


Thr 


Asp Thr 


He 
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210 215 220 



Asn 


lie 


Pro 


Asn 


Pro 


jxiet 


uys 


inr 


a 1 =a 

Hxa 


lie 


riJ.o 


Ala 


Leu 


Leu 


nxo 


iyr 


OOP 










9 "^n 










235 










94 0 


Pne 


Leu 


Leu 


vaj. 


inr 


xrne 


1 nr 


irp 


Asn 


Ala 
250 


Leu 


Ser 


Ala 


Ala 


C^ n 


Leu 


Tyr 


Tyr 


lieu 


jjeu 


iie 


Arg 


Tnr 


Kiet 


Lys 


Pro 


Leu 


Pro Arg His 


xrne 


T 1 














zoo 










0 T n 

210 






Leu 


Pne 


lie 
275 


Ser 


Leu 


ixe 


b±y 


Trp 
280 


Liiy 


Val 


Pro 


Ala 


He 
285 


Val 




rixa 


lie 


Thr 


Val 


Gly 


Val 


xie 


Tyr 


ber 


bin 


Asn Gly 


Asn 


Asn 


Pro 




Trp 




290 








295 










300 










Glu 


Leu 


Asp 


Tyr 


Arg 


Gin 


blU 


Lys 


lie 


Cys 


Trp 


Leu 


Ala 


He 


Pro 


olU 


305 










310 










315 












Pro 


Asn 


Gly 


Val 


He 


T • m 

Lys 


Ser 


Pro 


Leu 


Leu 


Trp 


Ser 


rne 


He 


vai 


Pro 








0 0 c 








330 










335 




Val 


Thr 


He 


He 
340 


Leu 


He 


Ser 


Asn 


Val 
345 


Val 


Met 


Phe 


He 


Thr 
350 


He 


Ser 


He 


Lys 


Val 

Q c c 


Leu 


Trp 


Lys 


Asn 


Asn 


Gin 


Asn 


Leu 


Thr 


Ser 
365 


Thr 


Lys 


Lys 


Val 


Ser 


Ser 


Met 


Lys 


Lys 


He 


Val 


Ser 


Thr 


Leu 


Ser 


Val 


Ala 


Val 


Val 




370 








375 










380 










Phe 


Gly 


He 


Thr 


Trp 


He 


Leu 


Ala 


Tyr 


Leu 


Met 


Leu 


Val 


Asn Asp 


Asp 


385 










390 










395 










400 


Ser 


He 


Arg 


He 


Val 
405 


Phe 


Ser 


Tyr 


He 


Phe 
410 


Cys 


Leu 


Phe 


Asn 


Thr 
415 


Thr 


Gin 


Val 


Trp 


Cys 


Gly 


Leu 


Val 





















420 



<210> 11 
<211> 1683 
<212> DNA 

<213> homo sapiens 
<400> 11 

atggagactt attccttgtc tttgggtaat caatcagtgg tggaacctaa catagcaata 60 

cagtcagcaa atttctcttc agaaaatgcg gtggggcctt caaatgttcg cttctctgtg 120 

cagaaaggag ctagcagttc tctagtttct agttcaacat ttatacatac aaatgtggat 180 

ggccttaacc cagatgcaca gactgagctt caggtcttgc ttaatatgac gaaaaattac 24 0 

accaagacat gcggctttgt agtttatcaa aatgacaagc ttttccaatc aaaaactttt 300 

acagctaaat cggattttag tcaaaaaatt atctcaagca aaactgatga aaatgagcaa 360 

gatcagagtg cttctgttga catggtcttt agtccaaagt acaaccaaaa agaatttcaa , 420 

ctctattcct atgcctgtgt ctattggaat ttgtcagcga aggactggga cacatatggc 480 

tgtcaaaaag acaagggcac tgatggattc ctgcgctgcc gctgcaacca tactactaat 540 

tttgctgtat taatgacttt caaaaaggat tatcaatatc ccaaatcact tgacatatta 600 

tccaacgttg gatgtgcact gtctgttact ggtctggctc tcacagttat atttcagatt 660 

gtcaccagga aagtcagaaa aacctcagta acctgggttt tggtcaatct gtgcatatca 720 

atgttgattt tcaacctcct ctttgtgttt ggaattgaaa actccaataa gaacttgcag 780 

acaagtgatg gtgacatcaa taatattgac tttgacaata atgacatacc caggacagac 840 

accattaaca tcccgaatcc catgtgcact gcgattgccg ccttactgca ctattttctg 900 

ttagtgacat ttacctggaa cgcactcagc gctgcacagc tctattacct tctaataagg 960 

accatgaagc ctcttcctcg gcatttcatt cttttcatct cattaattgg atggggagtc 1020 

ccagctatag tagtggctat aacagtggga gttatttatt ctcagaatgg aaataatcca 1080 

cagtgggaat tagactaccg gcaagagaaa atctgctggc tggcaattcc agaacccaat 1140 

ggtgttataa aaagtccgct gttgtggtca ttcatcgtac ctgtaaccat tatcctcatc 1200 

agcaatgttg ttatgtttat tacaatctcg atcaaagtgc tgtggaagaa taaccagaac 1260 

ctgacaagca caaaaaaagt ttcatccatg aagaagattg ttagcacatt atctgttgca 1320 

gttgtttttg gaattacctg gattctagca tacctgatgc tagttaatga tgatagcatc 1380 

aggatcgtct tcagctacat attctgcctt ttcaacacta cacagggatt gcaaattttt 1440 

atcctgtaca ctgttagaac aaaagtcttc cagagtgaag cttccaaagt gttgatgttg 1500 

ctatcgtcta ttgggagaag gaagtcattg ccttcagtga cgcggccgag gctgcgtgta 1560 

aagatgtata atttcctcag gtcattgcca accttacatg aacgctttag gctactggaa 1620 

acctctccga gtactgagga aatcacactc tctgaaagtg acaatgcaaa ggaaagcatc 1680 

tag 1683 
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<210> 12 
<211> 560 
<212> PRT 

<213> homo sapiens 



<400> 12 






















TV* r* 
JL Lll. 


iyr 


Q /-> >- 


ijeu 


C T" 


Leu Gly Asn 


Gin Ser Val Val Glu 


Pt"o 








0 










10 


15 




Asn lie 




He 


uj-n 


Ser 


Ala 


Asn 


Phe 


Ser 


Ser Glu Asn Ala Val 


ox Y 
















25 




30 




Pro Ser 


Asn 


vax 


Arg 


Phe 


Ser 


Val 


Gin 


Lys 


Gly Ala Ser Ser Ser 


_ 

Jbeu 














40 






45 




Val Ser 


Ser 


Ser 


1 nr 


Phe 


He 


His 


Thr Asn Val Asp Gly Leu Asn 


CX O 












55 








60 




Asp Ala 


Gin 


mr 




Leu 


Gin 


Val 


Leu 


Leu 


Asn Met Thr Lys Asn 


lyr 


65 








70 










75 


oU 


Thr Lys 


Thr 


Cys 


Gly 


Phe 


Val 


Val 


Tyr Gin Asn Asp Lys Leu Phe 


CjJLn 








85 










.90 


95 




Ser Lys 


Thr 


Phe 


Thr 


Ala 


Lys 


Ser Asp 


Phe 


Ser Gin Lys He He 


Ser 






100 










105 




110 




Ser Lys 


Thr 


Asp 


Glu 


Asn Glu Gin Asp Gin Ser Ala Ser Val Asp 


Met 




115 








120 






125 




Val Phe 


Ser 


Pro 


Lys 


Tyr Asn 


Gin 


Lys 


Glu 


Phe Gin Leu Tyr Ser 


Tyr 


130 










135 








140 




Ala Cys 


Val 


Tyr 


Trp 


Asn 


Leu 


Ser 


Ala 


Lys 


Asp Trp Asp Thr Tyr 


Gly 


145 








150 










155 


160 


Cys Gin 


Lys 


Asp 


Lys 


Gly Thr Asp Gly Phe Leu Arg Cys Arg Cys 


Asn 








165 










170 


175 




His Thr 


Thr 


Asn 


Phe 


Ala 


Val 


Leu 


Met 


Thr 


Phe Lys Lys Asp Tyr 


Gin 






180 










185 




190 




Tyr Pro 


Lys 


Ser 


Leu 


Asp He Leu Ser Asn Val Gly Cys Ala Leu 


Ser 




195 










200 






205 




Val Thr 


Gly 


Leu 


Ala 


Leu 


Thr 


Val 


He 


Phe 


Gin He Val Thr Arg 


Lys 


210 








215 








220 




Val Arg 


Lys 


Thr 


Ser 


Val 


Thr Trp Val 


Leu 


Val Asn Leu Cys He 


Ser 


225 








230 










235 


240 


Met Leu 


lie 


Phe 


Asn 


Leu 


Leu 


Phe 


Val 


Phe Gly He Glu Asn Ser 


Asn 








245 










250 


255 




Lys Asn 


Leu 


Gin 


Thr 


Ser Asp Gly Asp 


He 


Asn Asn He Asp Phe 


Asp 






260 










265 




270 




Asn Asn 


Asp 


He 


Pro 


Arg Thr Asp Thr 


He 


Asn He Pro Asn Pro 


Met 




275 










280 






285 




Cys Thr 


Ala 


He 


Ala 


Ala 


Leu 


Leu 


His 


Tyr 


Phe Leu Leu Val Thr 


rne 












295 








300 




inr irp 


Asn 


Ala 


Lieu 


Ser 


Ala 


Ala 


Gin 


Leu 


Tyr Tyr Leu Leu He 


Arg 










310 










315 




inr Met: 


Lys 


Pro 


Leu 


Pro Arg 


His 


Phe 


He 


Leu Phe He Ser Leu 


X jie 






o o c 










330 


335 




oiy 1 rp 


uj.y 


vaj. 


Pro 


Ala 


He 


Val 


Val 


Ala 


He Thr Val Gly Val 


J. j.e 






o4 U 










345 




350 




Tyr Ser 


Gin 


Asn 


Gly 


Asn 


Asn 


Pro 


Gin 


Trp 


Glu Leu Asp Tyr Arg 


Gin 




355 










360 






365 




Glu Lys 


He 


Cys 


Trp 


Leu 


Ala 


He 


Pro 


Glu 


Pro Asn Gly Val He 


Lys 


370 










375 








380 




Ser Pro 


Leu 


Leu 


Trp 


Ser 


Phe 


He 


Val 


Pro 


Val Thr He He Leu 


He 


385 






390 










395 


400 


Ser Asn 


Val 


Val 


Met 


Phe 


He 


Thr 


He 


Ser 


He Lys Val Leu Trp 


Lys 








405 










410 


415 




Asn Asn 


Gin 


Asn' 


Leu 


Thr 


Ser 


Thr 


Lys 


Lys 


Val Ser Ser Met Lys 


Lys 






420 










425 




430 




lie Val 


Ser 


Thr 


Leu 


Ser 


Val 


Ala 


Val 


Val 


Phe Gly He Thr Trp 


He 




435 










440 






445 
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Leu 


Ala 


iyr 


Leu 




450 






Ser Tyr 


Tl (a 
-L -L C 




A cue, 










Leu 


lyr 




Val 


Leu 


lie L. 












Val 


Thr 


Arg 


Pro 






515 




Leu 


Pro 


Thr 


Leu 




530 






Thr 


Glu 


Glu 


He 


545 









Met 


Leu 


Val 


Asn 






455 




Cys 


Leu 


Phe 


Asn 


470 






Val 


Arg 


Thr 


Lys 


485 








JJC Ll 


OCX 




He 

~L -L C 


Arg 


Leu 


Arg 


Val 








520 


His 


Glu 


Arg 


Phe 






535 




Thr 


Leu 


Ser 


Glu 



550 



Asp 


Asp 


Ser 


He 








460 


Thr 


Thr 


Gin 


Gly 






475 




Val 


Phe 


Gin 


Ser 




490 






Gly 


Arg 


Arg 


Lys 


505 








Lys 


Met 


Tyr 


Asn 


Arg 


Leu 


Leu 


Glu 








540 


Ser 


Asp 


Asn 


Ala 



555 



Arg 


He 


Val 


Phe 


Leu 


Gin 


He 


Phe 








480 


Glu 


Ala 


Ser 


Lys 






4 95 




Ser 


Leu 


Pro 


Ser 




510 






Phe 


Leu 


Arg 


Ser 


525 








Thr 


Ser 


Pro 


Ser 


Lys 


Glu 


Ser 


He 








560 



<210> 13 

<211> 1509 

<212> DNA 

<213> homo sapiens 



<400> 13 

atgccagtga 

cattacacca 

acttttacag 

gagcaagatc 

tttcaactct 

tatggctgtc 

actaattttg 

atattatcca 

cagattgtca 

atatcaatgt 

ttgcagacaa 

acagacacca 

tttctgttag 

ataaggacca 

ggagtcccag 

aatccacagt 

cccaatggtg 

ctcatcagca 

cagaacctga 

gttgcagttg 

agcatcagga 

atttttatcc 

atgttgctat 

cgtgtaaaga 

ctggaaacct 

agcatctag 



agatgctttt 
agacatgcgg 
ctaaatcgga 
agagtgcttc 
attcctatgc 
aaaaagacaa 
ctgtattaat 
acgttggatg 
ccaggaaagt 
tgattttcaa 
gtgatggtga 
ttaacatccc 
tgacatttac 
tgaagcctct 
ctatagtagt 
gggaattaga 
ttataaaaag 
atgttgttat 
caagcacaaa 
tttttggaat 
tcgtcttcag 
tgtacactgt 
cgtctattgg 
tgtataattt 
ctccgagtac 



caaagagttg 

ctttgtagtt 

ttttagtcaa 

tgttgacatg 

ctgtgtctat 

gggcactgat 

gactttcaaa 

tgcactgtct 

cagaaaaacc 

cctcctcttt 

catcaataat 

gaatcccatg 

ctggaacgca 

tcctcggcat 

ggctataaca 

ctaccggcaa 

tccgctgttg 

gtttattaca 

aaaagtttca 

tacctggatt 

ctacatattc" 

tagaacaaaa 

gagaaggaag 

cctcaggtca 

tgaggaaatc 



ctgctactgc 
tatcaaaatg 
aaaattatct 
gtctttagtc 
tggaatttgt 
ggattcctgc 
aaggattatc 
gttactggtc 
tcagtaacct 
gtgtttggaa 
attgactttg 
tgcactgcga 
ctcagcgctg 
ttcattcttt 
gtgggagtta 
gagaaaatct 
tggtcattca 
atctcgatca 
tccatgaaga 
ctagcatacc 
tgccttttca 
gtcttccaga 
tcattgcctt 
ttgccaacct 
acactctctg 



taatgatgat 
acaagctttt 
caagcaaaac 
caaagtacaa 
cagcgaagga 
gctgccgctg 
aatatcccaa 
tggctctcac 
gggttttggt 
ttgaaaactc 
acaataatga 
ttgccgcctt 
cacagctcta 
tcatctcatt 
tttattctca 
gctggctggc 
tcgtacctgt 
aagtgctgtg 
agattgttag 
tgatgctagt 
acactacaca 
gtgaagcttc 
cagtgacgcg 
tacatgaacg 
aaagtgacaa 



gcccttacaa 
ccaatcaaaa 
tgatgaaaat 
ccaaaaagaa 
ctgggacaca 
caaccatact 
atcacttgac 
agttatattt 
caatctgtgc 
caataagaac 
catacccagg 
actgcactat 
ttaccttcta 
aattggatgg 
gaatggaaat 
aattccagaa 
aaccattatc 
gaagaataac 
cacattatct 
taatgatgat 
gggattgcaa 
caaagtgttg 
gccgaggctg 
ctttaggcta 
tgcaaaggaa 



<210> 14 
<211> 502 
<212> PRT 

<213> homo sapiens 
<400> 14 

Met Pro Val Lys Met Leu Phe Lys Glu Leu Leu Leu Leu Leu Met Met 

1 5 10 15 

Met Pro Leu Gin His Tyr Thr Lys Thr Cys Gly Phe Val Val Tyr Gin 

20 25 30 

Asn Asp Lys Leu Phe Gin Ser Lys Thr Phe Thr Ala Lys Ser Asp Phe 

35 40 45 

Ser Gin Lys He He Ser Ser Lys Thr Asp Glu Asn Glu Gin Asp Gin 

50 55 60 

Ser Ala Ser Val Asp Met Val Phe Ser Pro Lys Tyr Asn Gin Lys Glu 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
' 840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1509 



12 



wo 01/81411 



PCT/USOl/13371 



65 










70 










75 






80 


Phe 


Gin 


Leu 


Tyr 


Ser 


Tyr Ala Cys 


Val 


Tyr 


Trp Asn Leu 


Ser 


Ala 


Lys 








85 










90 






95 




Asp 


Trp 


Asp 


Thr 


Tvr 


Gly Cys 


Gin 


Lys 


Asp 


Lys Gly Thr 


Asp Gly Phe 






100 










105 






110 






Leu Arg 


Cys 


Arg 


Cys 


noil 


His 


Thr 


Thr 


Asn 


Phe Ala Val 


Leu 


Met 


Thr 






115 










120 






125 








Phe 


Lys 
130 


Lys 


Asp 


Tyr 


Gin 


Tyr 
135 


Pro 


Lys 


Ser 


Leu Asp He 
140 


Leu 


Ser 


Asn 


Val Gly 


Cys 


Ala 


Leu 


Ser 


Val 


Thr 


Gly Leu 


Ala Leu Thr 


Val 


He 


Phe 


145 








150 










155 






1 fin 

X U u 


Gin 


He 


Val 


Thr 


Arg 
165 


Lys 


Val 


Arg 


Lys 


Thr 
170 


Ser Val Thr 


Trp 


Val 
175 


XlC Li 


Val 


Asn 


Leu 


Cys 
1 Rn 

X u u 


He 


Ser 


Met 


Leu 


He 
185 


Phe 


Asn Leu Leu 


Phe 
190 


Val 


Phe 


Gly He 


G'l n 

ox LL 


Asn 




Asn 


Lys 


Asn 


Leu 


Gin 


Thr Ser Astd 


Gly Asp 
















200 






205 








Asn 


Asn 


He 


Asp 


Phe 


Asp Asn Asn Asp 


He 


Pro Arg Thr 


Asp 


Thr 






210 










215 








220 








Asn 


He 


PT*n 

IT J. \J 


noli 


XT X U 


Met 


Cys 


Thr 


Ala 


He 


Ala Ala Leu 


Leu 


His 


lyr 


225 










230 










235 








Phe 


Leu 




Val 




Phe 


Thr Trp Asn Ala 


Leu Ser Ala 


Ala 


Gin 


Leu 




















250 






255 




Tyr 


Tyr 


Leu 


Leu 


He 


Arg 


Thr 


Met 


Lys 


Pro 


Leu Pro Arg 


His 


Phe 


He 














265 






270 






Leu 


Phe 


He 


OCX 


Jjeu 


He 


Gly Trp 


Gly Val 


Pro Ala T le 


Val 


Val 


/ixa 






^ t D 










280 






285 








He 


Thr 


Val 


Gly 


Val 


He 


Tyr 


Ser 


Gin 


Asn 


Gly Asn Asn 


Pro 


Gin 


Trp 




290 








295 








300 








Glu 


Leu 




±yr 




Gin 


Glu 


Lys 


He 


Cys 


Trn Leu Ala. 


He 


Pro 


Glu 


305 










310 










315 






320 


Pro 


Asn 


Gly 


Val 


He 


Lys 


Ser 


Pro 


Leu 


Leu 


Trp Ser Phe 


He 


Val 


Pro 
















330 






335 




Val 


Thr 


J, j,e 


xxc 


Leu 


He 


Ser 


Asn 


Val 
345 


Val 


Met Phe He 


Thr 
350 


He 


Ser 


He 


Lys 


Val 


±JC Li 


irp 


Lys 


Asn 


Asn 


Gin 


Asn 


Leu Thr Ser 


Thr Lys 


Lys 




R 










360 






365 








Val 


Ser 
370 




LMC L. 


_ 

Lys 


Lys 


He 
375 


Val 


Ser 


Thr 


T.en R^T . Val 

laJ^J> Va_l. 

380 


Ala 


Val 


Val 


Phe 


Gly 


J. Xc 




irp 


He 


Leu 


Ala 


Tyr 


Leu 


Met Leu Val 


Asn Asp Asp 


385 






390 










395 






400 


Ser 


He 


Arg 


He 


Val 


Phe 


Ser 


Tyr 


He 


Phe 


Cys Leu Phe 


Asn 


Thr 


Thr 








•i U J 










410 






415 




Gin 


Gly 


Leu 


UXil 


He 


Phe 


He 


Leu 


Tyr 


Thr 


Val Ara Thr 


Lys 


Val 


Phe 






•4 ^ VJ 










425 






430 






Gin 


Ser 


Glu 


Ala 


Ser 


Lys 


Val 


Leu 


Met 


Leu 


Leu Ser Ser 


He 


Gly Arg 






435 








440 






445 








Arg Lys 


Ser 


Leu 


Pro 


Ser 


Val 


Thr Arg 


Pro 


Arg Leu Arg 


Val 


Lys 


Met 




450 










455 








460 








Tyr Asn 


Phe 


Leu 


Arg 


Ser 


Leu 


Pro 


Thr 


Leu 


His Glu Arg 


Phe 


Arg 


Leu 


465 








470 










475 






480 


Leu 


Glu 


Thr 


Ser 


Pro 
485 


Ser 


Thr 


Glu 


Glu 


He 
490 


Thr Leu Ser 


Glu 


Ser 
495 


Asp 


Asn 


Ala 


Lys 


Glu 
500 


Ser 


He 



















<210> 15 

<211> 726 

<212> DNA 

<213> homo sapiens 

<400> 15 

atgttgattt tcaacctcct ctttgtgttt ggaattgaaa actccaataa gaacttgcag 60 
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acaagtgatg 
accattaaca 
ttagtgacat 
accatgaagc 
ccagctatag 
cagtgggaat 
ggtgttataa 
agcaatgttg 
ctgacaagca 
gttgtttttg 
aggatcgtct 
gtctga 



gtgacatcaa 
tcccgaatcc 
ttacctggaa 
ctcttcctcg 
tagtggctat 
tagactaccg 
aaagtccgct 
ttatgtttat 
caaaaaaagt 
gaattacctg 
tcagctacat 



taatattgac 
catgtgcact 
cgcactcagc 
gcatttcatt 
aacagtggga 
gcaagagaaa 
gttgtggtca 
tacaatctcg 
ttcatccatg 
gattctagca 
attctgcctt 



tttgacaata 
gcgattgccg 
gctgcacagc 
cttttcatct 
gttatttatt 
atctgctggc 
ttcatcgtac 
atcaaagtgc 
aagaagattg 
tacctgatgc 
ttcaacacta 



atgacatacc 
ccttactgca 
tctattacct 
cattaattgg 
ctcagaatgg 
tggcaattcc 
ctgtaaccat 
tgtggaagaa 
ttagcacatt 
tagttaatga 
cacaggtatg 



caggacagac 
ctattttctg 
tctaataagg 
atggggagtc 
aaataatcca 
agaacccaat 
tatcctcatc 
taaccagaac 
atctgttgca 
tgatagcatc 
gtgcggatta 



<210> 16 . 
<211> 241 
<212> 'PRT 
<213> homo sapiens 

<400> 16 

Met Leu lie Phe Asn 
- 1 5 
Lys Asn Leu Gin Thr 
20 

Asn Asn Asp lie Pro 
35 

Cys Thr Ala lie Ala 
50 

Thr Trp Asn Ala Leu 
65 

Thr Met Lys Pro Leu 
85 

Gly Trp Gly Val Pro 
100 

Tyr Ser Gin Asn Gly 
115 

Glu Lys lie Cys Trp 
130 

Ser Pro Leu Leu Trp 
145 

Ser Asn Val Val Met 
165 

Asn Asn Gin Asn Leu 
180 

lie Val Ser Thr Leu 
195 

Leu Ala Tyr Leu Met 
210 

Ser Tyr lie Phe Cys 

225 

Val 



Leu Leu Phe 

Ser Asp Gly 

Arg Thr Asp 
40 

Ala Leu Leu 
55 

Ser Ala Ala 
70 

Pro Arg His 

Ala He Val 

Asn Asn Pro 
120 

Leu Ala He 

135 
Ser Phe He 
150 

Phe He Thr 

Thr Ser Thr 

Ser Val Ala 
200 

Leu Val Asn 

215 
Leu Phe Asn 
230 



Val Phe 
10 

Asp He 
25 

Thr He 

His Tyr 

Gin Leu 

Phe He 
90 

Val Ala 
105 

Gin Trp 

Pro Glu 

Val Pro 

He Ser 
170 
Lys Lys 
185 

Val Val 
Asp Asp 
Thr Thr 



Gly He Glu Asn 
Asn Asn 
Asn He 



Phe Leu 

60 
Tyr Tyr 
75 

Leu Phe 

He Thr 

Glu Leu 

Pro Asn 
140 
Val Thr 
155 

He Lys 

Val Ser 

Phe Gly 

Ser He 
220 
Gin Val 
235 



He Asp 

30 
Pro Asn 
45 

Leu Val 



Leu Leu 

He Ser 

Val Gly 
110. 
Asp Tyr 
125 

Gly Val 

He He 

Val Leu 

Ser Met 
190 
He Thr 
205 

Arg He 
Trp Cys 



Ser Asn 
15 

Phe Asp 

Pro Met 

Thr Phe 

He Arg 

80 
Leu He 
95 

Val He 

Arg Gin 

He Lys 

Leu He 
160 
Trp Lys 
175 

Lys Lys 

Trp He 

Val Phe 

Gly Leu 
240 



120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
726 



<210> 17 

<211> 963 

<212> DNA 

<213> homo sapiens 



<400> 17 

atgttgattt 

acaagtgatg 

accattaaca 

ttagtgacat 

accatgaagc 



tcaacctcct 
gtgacatcaa 
tcccgaatcc 
ttacctggaa 
ctcttcctcg 



ctttgtgttt 
taatattgac 
catgtgcact 
cgcactcagc 
gcatttcatt 



ggaattgaaa 
tttgacaata 
gcgattgccg 
gctgcacagc 
cttttcatct 



actccaataa 
atgacatacc 
ccttactgca 
tctattacct 
cattaattgg 



gaacttgcag 
caggacagac 
ctattttctg 
tctaataagg 
atggggagtc 



60 
120 
180 
240 
300 
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ccagctatag tagtggctat aacagtggga gttatttatt ctcagaatgg aaataatcca 360 

cagtgggaat tagactaccg gcaagagaaa atctgctggc tggcaattcc agaacccaat 420 

ggtgttataa aaagtccgct gttgtggtca ttcatcgtac ctgtaaccat tatcctcatc 480 

agcaatgttg ttatgtttat tacaatctcg atcaaagtgc tgtggaagaa taaccagaac 540 

ctgacaagca caaaaaaagt ttcatccatg aagaagattg ttagcacatt atctgttgca 600 

gttgtttttg gaattacctg gattctagca tacctgatgc tagttaatga tgatagcatc 660 

aggatcgtct tcagctacat attctgcctt ttcaacacta cacagggatt gcaaattttt 720 

atcctgtaca ctgttagaac aaaagtcttc cagagtgaag cttccaaagt gttgatgttg 780 

ctatcgtcta ttgggagaag gaagtcattg ccttcagtga cgcggccgag gctgcgtgta 840 

aagatgtata atttcctcag gtcattgcca accttacatg aacgctttag gctactggaa 900 

acctctccga gtactgagga aatcacactc tctgaaagtg acaatgcaaa ggaaagcatc . 960 

tag 963 

<210> 18 
<211> 320 
<212> PRT 

<213> homo sapiens 
<400> 18 

Met Leu lie Phe Asn Leu Leu Phe Val Phe Gly He Glu Asn Ser Asn 

15 10 15 

Lys Asn Leu Gin Thr Ser Asp Gly Asp He Asn Asn He Asp Phe Asp 

20 25 30 

Asn Asn Asp He Pro Arg Thr Asp Thr He Asn He Pro Asn Pro Met 

35 40 45 

Cys Thr Ala He Ala Ala Leu Leu His Tyr Phe Leu Leu Val Thr Phe 

50 55 60 

Thr Trp Asn Ala Leu Ser Ala Ala Gin Leu Tyr Tyr Leu Leu He Arg 
65 70 75 80 

Thr Met Lys Pro Leu Pro Arg His Phe He Leu Phe He Ser Leu He 

85 90 95 

Gly Trp Gly Val Pro Ala He Val Val Ala He Thr Val Gly Val He 

100 105 110 

Tyr Ser Gin Asn Gly Asn Asn Pro Gin Trp Glu Leu Asp Tyr Arg Gin 

115 120 125 

Glu Lys He Cys Trp Leu Ala He Pro Glu Pro Asn Gly Val He Lys 

130 135 140 

Ser Pro Leu Leu Trp Ser Phe He Val Pro Val Thr He He Leu He 
145 150 155 160 

Ser Asn Val Val Met Phe He Thr He Ser He Lys Val Leu Trp Lys 

165 170 175 

Asn Asn Gin Asn Leu Thr Ser Thr Lys Lys Val Ser Ser Met Lys Lys 

180 185 190 

He Val Ser Thr Leu Ser Val Ala Val Val Phe Gly He Thr Trp He 

195 200 205 

Leu Ala Tyr Leu Met Leu Val Asn Asp Asp Ser He Arg He Val Phe 

210 215 220 

Ser Tyr He Phe Cys Leu Phe Asn Thr Thr Gin Gly Leu Gin He Phe 
225 230 235 240 

He Leu Tyr Thr Val Arg Thr Lys Val Phe Gin Ser Glu Ala Ser Lys 

245 250 255 

Val Leu Met Leu Leu Ser Ser He Gly Arg Arg Lys Ser Leu Pro Ser 

260 265 270 

Val Thr Arg Pro Arg Leu Arg Val Lys Met Tyr Asn Phe Leu Arg Ser 

275 280 285 

Leu Pro Thr Leu His Glu Arg Phe Arg Leu Leu Glu Thr Ser Pro Ser 

290 295 300 

Thr Glu Glu He Thr Leu Ser Glu Ser Asp Asn Ala Lys Glu Ser He 
305 310 315 320 

<210> 19 
<211> 3057 
<212> DNA 
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<213> homo sapiens 
<400> 19 

ggcggcggcc gcggcctccg caagccgtct ttctctagag ttgtatatat agaacatcct ^9- 

ggagtccacc atgaacggac agttggatct aagtgggaag ctaatcatca aagctcaact 120 

tggggaggat attcggcgaa ttcctattca taatgaagat attacttatg atgaattagt * 180 

gctaatgatg caacgagttt tcagaggaaa acttctgagt aatgatgaag taacaataaa 240 

gtataaagat gaagatggag atcttataac aatttttgat agttctgacc tttcctttgc 300 

aattcagtgc agtaggatac tgaaactgac attatttgga aaatctactt cctcatcaag 360 

cacccctaca gagttctgca ggaatggtgg aacctgggaa aatggcagat gtatttgtac 420 

agaagagtgg aaaggactga gatgtacaat tgctaatttt tgtgaaaata gtacctatat 480 

gggttttact tttgccagaa tcccagtggg cagatatgga ccatccttgc aaacatgtgg 540 

caaggatact ccaaatgcgg gcaatccaat ggcagtccgg ttgtgcagtc tctctctata 600 

tggagagata gaattacaaa aagtgacaat aggaaattgc aatgaaaatc tggaaaccct 660 

ggaaaagcag gtaaaggatg tcacagcacc acttaataac atttcttctg aagtccagat 720 

tttaacatct gatgccaata aattaactgc tgagaacatc actagtgcta cgcgagtggt 780 

tggacagata ttcaacactt ccagaaatgc ttcacctgag gcaaagaaag ttgccatagt 84 0 

aacagtgagt caactcctag atgccagtga agatgctttt caaagagttg ctgctactgc 900 

taatgatgat gcccttacaa cgcttattga gcaaatggag acttattcct tgtctttggg 960 

taatcaatca gtggtggaac ctaacatagc aatacagtca gcaaatttct cttcagaaaa 1020 

tgcggtgggg ccttcaaatg ttcgcttctc tgtgcagaaa ggagctagca gttctctagt ' 1080 

ttctagttca acatttatac atacaaatgt ggatggcctt aacccagatg cacagactga 1140 

gcttcaggtc ttgcttaata tgacgaaaaa ttacaccaag acatgcggct ttgtagttta 1200 

tcaaaatgac aagcttttcc aatcaaaaac ttttacagct aaatcggatt ttagtcaaaa 1260 

aattatctca agcaaaactg atgaaaatga gcaagatcag agtgcttctg ttgacatggt 1320 

ctttagtcca aagtacaacc aaaaagaatt tcaactctat tcctatgcct gtgtctattg 1380 

gaatttgtca gcgaaggact gggacacata tggctgtcaa aaagacaagg gcactgatgg 14 4 0 

attcctgcgc tgccgctgca accatactac taattttgct gtattaatga ctttcaaaaa 1500 

ggattatcaa tatcccaaat cacttgacat attatccaac gttggatgtg cactgtctgt 1560 

tactggtctg gctctcacag ttatatttca gattgtcacc aggaaagtca gaaaaacctc 1620 

agtaacctgg gttttggtca atctgtgcat atcaatgttg attttcaacc tcctctttgt 1680 

gtttggaatt gaaaactcca ataagaactt gcagacaagt gatggtgaca tcaataatat 1740 

tgactttgac aataatgaca tacccaggac agacaccatt aacatcccga atcccatgtg 1800 

cactgcgatt gccgccttac tgcactattt tctgttagtg acatttacct ggaacgcact 1860 

cagcgctgca cagctctatt accttctaat aaggaccatg aagcctcttc ctcggcattt 1920 

cattcttttc atctcattaa ttggatgggg agtcccagct atagtagtgg ctataacagt 1980 

gggagttatt tattctcaga atggaaataa tccacagtgg gaattagact accggcaaga 2040 

gaaaatctgc tggctggcaa ttccagaacc caatggtgtt ataaaaagtc cgctgttgtg 2100 

gtcattcatc gtacctgtaa ccattatcct catcagcaat gttgttatgt ttattacaat 2160 

ctcgatcaaa gtgctgtgga agaataacca gaacctgaca agcacaaaaa aagtttcatc 2220 

catgaagaag attgttagca cattatctgt tgcagttgtt tttggaatta cctggattct 2280 

agcatacctg atgctagtta atgatgatag catcaggatc gtcttcagct acatattctg 2340 

ccttttcaac actacacagg gattgcaaat ttttatcctg tacactgtta gaacaaaagt 2400 

cttccagagt gaagcttcca aagtgttgat gttgctatcg tctattggga gaaggaagtc 2460 

attgccttca gtgacgcggc cgaggctgcg tgtaaagatg tataatttcc tcaggtcatt 2520 

gccaacctta catgaacgct ttaggctact ggaaacctct ccgagtactg aggaaatcac 2580 

actctctgaa agtgacaatg caaaggaaag catctagaca gtaaaactta cctgttgtgg 2640 

tctttttaat cacctcgttt gagttttatc tgtttctctc ctttatttcc cagtcctctc 2700 

agaaagtctt cctcaatgta ttttgctcag gattaagaat tagataaaac ctgttgttta 2760 

ttattattcg gcataatgga , cttggtagtt tttctatttt tcaatagatt tgtacttgaa 2820 

taaggtgaag aatttcacac aacatacaag agtaccattg ttccttatat cgttaaatct 2880 

ttgtgacaca ctttgacaaa aatgtagaac ctataacaaa ttcttttaca agttactata 2940 

aaggacacaa agagaaaact ttaccttcca gaacaaaatg actcctgatg aacagtgtgt 3000 

ggggatttgc ttgtatgtat taaacttttg acctctgaaa aaaaaaaaaa aaaaaaa 3057 
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